Search CORE

11 research outputs found

Sample Reproducibility of Genetic Association Using Different Multimarker TDTs in Genome-Wide Association Studies: Characterization and a New Approach

Author: A Alcina
A Rinaldo
C Bourgain
D Clayton
D Schaid
DA Hinds
DC Crawford
DH ‘International Multiple Sclerosis Genetics Consortium’
F Lundmark
F Matesanz
Fuencisla Matesanz
G Hellenthal
G Kristjansdottir
GR Abecasis
H BickeBöller
H Seltman
J Gregory
J Haer
J Lam
J Marchini
J Ott
J Tzeng
J Zhao
JD Rioux
Joel S. Bader
K Yu
K Yu
M Daly
M Zoledziewska
Mara M. Abad-Grau
MM Abad-Grau
NL Kaplan
Nuria Medina-Medina
P Scheet
P Sevon
PC Sham
PC Sham
PD Jager
R Hudson
R Montes
R Tang
RA Betensky
Rosana Montes-Soldado
RS Spielman
RZ Fan
S Gabriel
S Ramagopalan
S Zhang
TI HapMap-Consortium
Vineet Bafna
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Multimarker Transmission/Disequilibrium Tests (TDTs) are very robust association tests to population admixture and structure which may be used to identify susceptibility loci in genome-wide association studies. Multimarker TDTs using several markers may increase power by capturing high-degree associations. However, there is also a risk of spurious associations and power reduction due to the increase in degrees of freedom. In this study we show that associations found by tests built on simple null hypotheses are highly reproducible in a second independent data set regardless the number of markers. As a test exhibiting this feature to its maximum, we introduce the multimarker -Groups TDT (), a test which under the hypothesis of no linkage, asymptotically follows a distribution with degree of freedom regardless the number of markers. The statistic requires the division of parental haplotypes into two groups: disease susceptibility and disease protective haplotype groups. We assessed the test behavior by performing an extensive simulation study as well as a real-data study using several data sets of two complex diseases. We show that test is highly efficient and it achieves the highest power among all the tests used, even when the null hypothesis is tested in a second independent data set. Therefore, turns out to be a very promising multimarker TDT to perform genome-wide searches for disease susceptibility loci that may be used as a preprocessing step in the construction of more accurate genetic models to predict individual susceptibility to complex diseases

Public Library of Science (PLOS)

CiteSeerX

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

PubMed Central

Repositorio Institucional Universidad de Granada

Digital.CSIC

FigShare

Sliding window maps for the -affected data set.

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

Window size is TDTs used were (red diamonds), (green squares), (purple circles) and (blue triangles).</p

FigShare

The table used by .

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

Only those parental genotypes with one haplotype in each group are used by The counts refer to the number of times haplotypes in one group are transmitted by heterozygous parents to their affected offspring.</p

FigShare

Type I error rates in presence of population stratification and admixture for and .

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

Results for different minor allele frequencies (MAFs) in the second subpopulation (q) and different proportion of trios from the first subpopulation (pp), obtained by (top half) and (bottom half) for nominal levels and and haplotypes of length , , , and (columns to respectively).</p

FigShare

Sliding window maps for the -affected data set.

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

Window size is TDTs used were (red diamonds), (green squares), (purple circles) and (blue triangles).</p

FigShare

Sliding window maps for the -affected data set.

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

Window size is TDTs used were (red diamonds), (green squares), (purple circles) and (blue triangles).</p

FigShare

Genotype counts and their transmissions used by .

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

Haplotypes in rows represent those transmitted haplotypes at each genotype. Haplotypes in columns represent those nontransmitted haplotypes at each genotype. Homozygous genotype counts (diagonal) are crossed off the tables as they are not used to compute Left grid: genotype counts from the training data set (see <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0029613#pone-0029613-t002" target="_blank">Table 2</a>) used to make up groups and in Groups are: with those haplotypes with counts larger than counts (Ab: versus and aB: versus ) and with counts larger than counts ( versus ). Right grid: genotype counts from the test data set used to compute the statistic. As the length similarity measure is used to assign an haplotype to a group, and the two most similar haplotypes to haplotype belongs to group is assigned to All the haplotypes belonging to the same group are considered of having an equivalent effect and are collapsed. Therefore, parental genotypes in the test data set with haplotypes belonging to the same group are considered as homozygous and not used by (they are crossed off the table too).</p

FigShare

An example of parental genotype counts showing transmitted and nontransmitted haplotypes in a training and test data sets of nuclear families and haplotypes of length ( different haplotypes: AB, AB, aB and ab).

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

The total number of trios is ( parents) so that half of them ( trios, parents) were randomly assigned to the training data set and the others to the test data set. Each row shows counts for a possible configuration (there are possible configurations for haplotypes of length ) of the transmitted (second column) and nontransmitted (third column) haplotypes in a parental genotype.</p

FigShare

Association rates of and using a second data set to test reproducibility.

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

Results for 100 simulations of 250 family trios as a function of the recombination rate using the dominant and one-locus genetic model and haplotypes of lengths (left plot), (plot in the middle) and (right plot). A nominal level of and a relative risk of were used for all plots. Results for and are plotted in purple circles and blue triangles respectively. Dashed lines show results for the data subset (125 trios randomly chosen) used to build the model while solid lines show results for a second data subset (the remaining 125 trios) used to test reproducibility.</p

FigShare

Association rates for different proportions of missing haplotypes.

Author: Fuencisla Matesanz (166725)
Mara M. Abad-Grau (183389)
Nuria Medina-Medina (183391)
Rosana Montes-Soldado (183395)
Vineet Bafna (44575)
Publication venue
Publication date
Field of study

Results for 100 simulations of family trios as a function of the proportion of missing haplotypes using the additive and one-locus genetic model and haplotypes of lengths (left plot), (plot in the middle) and (right plot). A nominal level of and a relative risk of were used for all plots. Results for and i.e. all tests were applied under the holdout approach, are plotted in purple circles, blue triangles, green squares and red diamonds respectively.</p

FigShare