Search CORE

16 research outputs found

To Control False Positives in Gene-Gene Interaction Analysis: Two Novel Conditional Entropy-Based Approaches

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date: 01/01/2013
Field of study

<div>Genome-wide analysis of gene-gene interactions has been recognized as a powerful avenue to identify the missing genetic components that can not be detected by using current single-point association analysis. Recently, several model-free methods (e.g. the commonly used information based metrics and several logistic regression-based metrics) were developed for detecting non-linear dependence between genetic loci, but they are potentially at the risk of inflated false positive error, in particular when the main effects at one or both loci are salient. In this study, we proposed two conditional entropy-based metrics to challenge this limitation. Extensive simulations demonstrated that the two proposed metrics, provided the disease is rare, could maintain consistently correct false positive rate. In the scenarios for a common disease, our proposed metrics achieved better or comparable control of false positive error, compared to four previously proposed model-free metrics. In terms of power, our methods outperformed several competing metrics in a range of common disease models. Furthermore, in real data analyses, both metrics succeeded in detecting interactions and were competitive with the originally reported results or the logistic regression approaches. In conclusion, the proposed conditional entropy-based metrics are promising as alternatives to current model-based approaches for detecting genuine epistatic effects.</div

CiteSeerX

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

FigShare

Chi-squared Q-Q plots for the additive-additive model with main effect at both locus (Schema 3).

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

Top panels: A. GenoMI; B. GenoCMI; C. GameteCMI. Middle panels: D. original Wu et al statistic; E. adjusted Wu statistic; F. joint effect statistic. Bottom panel: G. logistic regression model with 1 df test; H. logistic regression model with 4 df test.</p

FigShare

Comparison of P-values in testing gene-gene interaction between hemoglobin (Hb) gene and α+-thalassemia gene.

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

afrequencies were shown as No. of case/No. of control.bP-values reported by Williams et al.cthe lowest P-value among logistic regression models by assuming additive × additive, dominant × dominant and recessive × recessive interaction models, respectively.dobtained by logistic regression model by coding genotypes as factors.</p

FigShare

Description of simulation schemas.

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

aIn each schema, three two-locus interaction models (additive × additive, dominant × dominant and recessive × recessive) were evaluated.bORG, ORH, and ORGH denote the main effect for locus G, main effect for locus H, and their interaction effect, respectively. “√” indicates that the effect is present. “–” indicates that the effect is absent.cDisease prevalence (baseline penetrance).dFor Schemas 8 and 9, the interaction effect ORGH was increased from 1.0 to a value at which the power of the optimal metric achieved 100% at significance level 0.01.</p

FigShare

Null distribution of the GenoCMI and GameteCMI metrics.

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

A. The empirically null distribution of GenoCMI, compared to its theoretical distribution χ2(8). B. The empirically null distribution of GameteCMI, compared to its theoretical distribution χ2(2).</p

FigShare

Application of entropy-based statistics for testing gene-gene interaction between SNP309 in MDM2 gene and codon72 polymorphism in p53 gene.

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

afrequencies were shown as No. of individuals genotyped as TT/TG/GG of MDM2 309T>G in case.bfrequencies were shown as No. of individuals genotyped as TT/TG/GG of MDM2 309T>G in control.1obtained by logistic regression model assuming additive × additive model.2obtained by logistic regression model assuming dominant × dominant model.3obtained by logistic regression model assuming recessive × recessive model.4obtained by logistic regression model by coding genotypes as factors.GCC: gaster cardia cancer; LC: lung cancer; HCC: hepatacelluar cancer; BC: breast cancer.</p

FigShare

Chi-squared Q-Q plots for the recessive-recessive model with main effect at both loci, when case/control ratios varied (Schema 7).

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

Assuming main effects at both locus (ORG = ORH = 2.0) and disease prevalence 0.02. Top panels: A. GenoMI; B. GenoCMI; C. GameteCMI. Middle panels: D. original Wu et al statistic; E. adjusted Wu statistic; F. joint effect statistic. Bottom panel: G. logistic regression model with 1 df test; H. logistic regression model with 4 df test.</p

FigShare

Chi-squared Q-Q plots for the global null hypothesis (Schema 1).

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

FigShare

Chi-squared Q-Q plots for the dominant-donimant model with main effect at both locus (Schema 3).

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

FigShare

False positive rates (type 1 error rates) for testing interaction in common disease with main effect at one locus (Schema 2).

Author: An Fan (495459)
Haoli Li (495460)
Jiheng Qin (495462)
Meihua Lin (82065)
Shaoqi Rao (14985)
Xiaolei Zhao (495461)
Xiaoyu Zuo (199420)
Publication venue
Publication date
Field of study

alogistic regression model with 1 df test for the correct genetic model.blogistic regression model with 4 df test by coding genotypes as factors.The disease prevalence is assumed 0.02. The significance level is set as 0.01.</p

FigShare

To Control False Positives in Gene-Gene Interaction Analysis: Two Novel Conditional Entropy-Based Approaches

Chi-squared Q-Q plots for the additive-additive model with main effect at both locus (Schema 3).

Comparison of <i>P</i>-values in testing gene-gene interaction between hemoglobin (<i>Hb</i>) gene and <i>α</i><sup>+</sup>-thalassemia gene.

Description of simulation schemas.

Null distribution of the <i>GenoCMI</i> and <i>GameteCMI</i> metrics.

Application of entropy-based statistics for testing gene-gene interaction between SNP309 in <i>MDM2</i> gene and codon72 polymorphism in <i>p53</i> gene.

Chi-squared Q-Q plots for the recessive-recessive model with main effect at both loci, when case/control ratios varied (Schema 7).

Chi-squared Q-Q plots for the global null hypothesis (Schema 1).

Chi-squared Q-Q plots for the dominant-donimant model with main effect at both locus (Schema 3).

False positive rates (type 1 error rates) for testing interaction in common disease with main effect at one locus (Schema 2).