Search CORE

10 research outputs found

Power estimates for multiple genes case-control studies with causal variants from disease etiologies randomly sampled from nine multinomial distributions (Figure S3).

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

Power estimates for BOMP, VT, SKAT, KBAC (KBAC1P = minor allele frequency defined as , KBAC5P = minor allele frequency defined as ). Each vertical line represents power estimates for each method, based on 250 simulated case-control studies. The genomic individuals each had nine genes, of which three contained causal variants and six did not. The disease etiologies for the three genes with causal variants were randomly sampled from nine multinomial distributions (<a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224.s003" target="_blank">Figure S3</a>). AA = African-American simple bottleneck demographic model. EA = European-American exponential growth demographic model.</p

FigShare

Eight disease etiologies used in simulation experiments.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

Rare variant = disease caused by multiple rare deleterious variants. Low frequency variant = disease caused by multiple low frequency deleterious variants. Key Region variant = rare deleterious variants are localized to key regions. Common variant = disease caused by a single deleterious common variant. The etiologies Rare+Protect, LowFreq+Protect, KeyRegion+Protect and Common+Protect were identical to the first four except that they include protective variants.1Minor allele frequency of deleterious causal variants,2Selection coefficients of deleterious causal variants,3Effect size of deleterious causal variants,4Selection coefficient of protective causal variants,5Effect size of protective modifier variants,6Required functional role of causal and protective variants, NS = coding non-synonymous, AA = African-American simple bottleneck demographic model <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224-Boyko1" target="_blank">[44]</a>, EA = European-American exponential growth demographic model <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224-Kryukov1" target="_blank">[19]</a>).* for protective modifier variants with AF5%, for protective modifier variants with AF5%.</p

FigShare

Power estimates for multiple gene case-control studies with causal variants equally likely to be from any disease etiology dominated by rare variants.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

A,B. X-axis shows number of candidate genes in 250 simulated case-control studies (approximately one-third each from disease etiologies Rare, LowFreq and KeyRegion). All genes contain causal variants. For each method, average power is shown. Power increases for all methods as the number of candidate genes with causal variants increases. C,D. X-axis shows the number of candidate genes and the ratio of genes containing causal variants to those that do not contain causal variants. As the ratio decreases, the power of the tested methods also decreases. (Tested methods are BOMP, VT, SKAT and KBAC1P = minor allele frequency defined as , KBAC5P = minor allele frequency defined as ). AA = the case-control studies were drawn from gene populations generated with an African-American simple bottleneck demographic model. EA = the case-control studies were drawn from gene populations generated with a European-American exponential growth demographic model.)</p

FigShare

Analytical comparison of SKAT, BOMP, and VT on a toy example.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

Genotypes of 8 cases and 8 controls at 10 positions. Matrix column colors: controls = light blue, cases = light red. Position distribution bar colors: controls = blue, cases = red. Detailed description is in the section “Toy example with analytical calculations” (<a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224.s013" target="_blank">Text S1</a>).</p

FigShare

Single gene methods power comparison.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

Power estimates for BOMP, VT, SKAT, KBAC (KBAC1P = minor allele frequency defined as , KBAC5P = minor allele frequency defined as ). Each vertical line represents power estimates for each method, based on 250 simulated case-control studies. AA = the case-control studies were drawn from gene populations generated with an African-American simple bottleneck demographic model. EA = the case-control studies were drawn from gene populations generated with a European-American exponential growth demographic model. The eight variant causality (disease etiology) models are defined in <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen-1003224-t001" target="_blank">Table 1</a>. Since the European-American demographic model does not account for common or protective variants, etiologies involving common or protective variants were only considered for the African-American demographic model.</p

FigShare

Dallas Heart Study.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

P-values of association between dichotomized triglyceride levels and variation in three ANGPTL family genes sequenced in Dallas Heart Study. ANGPTL - multiple gene set including ANGPTL3, ANGPTL4, and ANGPTL5. The most significant P-value for each is highlighted in bold. BOMP = combined Burden and Position statistics VT = variable threshold burden test <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224-Price1" target="_blank">[10]</a> SKAT = sequence kernel association test (linear weighting version) <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224-Wu1" target="_blank">[12]</a>, KBAC = Kernel-based adaptive cluster <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224-Liu2" target="_blank">[20]</a> (1D = single direction, 2D = two direction, 1P = rare variants defined as MAF, 5P = rare variants defined as MAF). VEST = BOMP and VT with VEST score variant weighting.</p

FigShare

BOMP P-values for gene sets in Bipolar case-control study.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

The gene sets were selected for testing because they contained genes and were the most significantly enriched by synaptic genes <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224-Pirooznia1" target="_blank">[26]</a>. Seven of the genes sets were nominally associated with bipolar disorder (P0.05) and have FDR0.1.*FDR computed with the Benjamini-Hochberg algorithm <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen.1003224-Benjamini1" target="_blank">[45]</a>.**Wall-clock time in minutes.</p

FigShare

BOMP burden and position statistics complement each other.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

Breakdown of contribution of BOMP mutation burden (BOMP_B) and BOMP position distribution (BOMP_P) statistics averaged over single candidate gene power estimates (<a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen-1003224-g001" target="_blank">Figure 1</a>) and multiple candidate gene power estimates (nine genes, 3 with causal variants and 6 with no causal variants) (<a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003224#pgen-1003224-g003" target="_blank">Figure 3</a>) for case-control study sizes of 200, 1000, 2000, and 5000. Combining the two statistics consistently yielded improved power with respect to each statistic on its own. The BOMP burden statistic had more power than BOMP position for the simulations based on a single candidate gene, and vice versa in the simulations with nine candidate genes and 3∶6 causal to non-causal ratio.</p

FigShare

Components of BOMP Hybrid Likelihood Model compared.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

A. Mutation burden statistic. The Mutation burden statistic uses the aggregated burden for cases, , and controls . B. Mutation position distribution statistic. Aggregated window mutation counts are calculated for cases, , controls, , and cases and controls combined, , across windows.</p

FigShare

Example variation pattern in which position distribution outperforms burden tests.

Author: Fernando S. Goes (276510)
Hannah Carter (276508)
James B. Potash (171581)
Jennifer Parla (276509)
Mehdi Pirooznia (23876)
Melissa Kramer (257074)
Peter P. Zandi (215438)
Rachel Karchin (61264)
W. Richard McCombie (26862)
Yun-Ching Chen (276507)
Publication venue
Publication date
Field of study

A toy example of a genomic region containing variants (blue squares) in cases and controls. We assume that the region is important for phenotype. Variant counts in cases (red). Variant counts in controls (purple). Cases and controls each have a total of 9 variants in this region, so Burden statistics (e.g., VT or BOMP burden) will not be able to detect that the region is important for phenotype. BOMP's position distribution statistic collapses variants into short, localized windows (red dashed lines) and detects that the number of variants seen in cases and controls is different within the windows. We note that a method that does not collapse variants, such as SKAT, does not have much power to detect the difference between cases and controls, because at each position the number of variants in cases and controls is similar.</p

FigShare