53 research outputs found

    A General Framework for Formal Tests of Interaction after Exhaustive Search Methods with Applications to MDR and MDR-PDT

    Get PDF
    The initial presentation of multifactor dimensionality reduction (MDR) featured cross-validation to mitigate over-fitting, computationally efficient searches of the epistatic model space, and variable construction with constructive induction to alleviate the curse of dimensionality. However, the method was unable to differentiate association signals arising from true interactions from those due to independent main effects at individual loci. This issue leads to problems in inference and interpretability for the results from MDR and the family-based compliment the MDR-pedigree disequilibrium test (PDT). A suggestion from previous work was to fit regression models post hoc to specifically evaluate the null hypothesis of no interaction for MDR or MDR-PDT models. We demonstrate with simulation that fitting a regression model on the same data as that analyzed by MDR or MDR-PDT is not a valid test of interaction. This is likely to be true for any other procedure that searches for models, and then performs an uncorrected test for interaction. We also show with simulation that when strong main effects are present and the null hypothesis of no interaction is true, that MDR and MDR-PDT reject at far greater than the nominal rate. We also provide a valid regression-based permutation test procedure that specifically tests the null hypothesis of no interaction, and does not reject the null when only main effects are present. The regression-based permutation test implemented here conducts a valid test of interaction after a search for multilocus models, and can be applied to any method that conducts a search to find a multilocus model representing an interaction

    Genetic Epidemiology of Tuberculosis Susceptibility: Impact of Study Design

    Get PDF
    Several candidate gene studies have provided evidence for a role of host genetics in susceptibility to tuberculosis (TB). However, the results of these studies have been very inconsistent, even within a study population. Here, we review the design of these studies from a genetic epidemiological perspective, illustrating important differences in phenotype definition in both cases and controls, consideration of latent M. tuberculosis infection versus active TB disease, population genetic factors such as population substructure and linkage disequilibrium, polymorphism selection, and potential global differences in M. tuberculosis strain. These considerable differences between studies should be accounted for when examining the current literature. Recommendations are made for future studies to further clarify the host genetics of TB

    Interleukin 12B (IL12B) Genetic Variation and Pulmonary Tuberculosis: A Study of Cohorts from The Gambia, Guinea-Bissau, United States and Argentina

    Get PDF
    We examined whether polymorphisms in interleukin-12B (IL12B) associate with susceptibility to pulmonary tuberculosis (PTB) in two West African populations (from The Gambia and Guinea-Bissau) and in two independent populations from North and South America. Nine polymorphisms (seven SNPs, one insertion/deletion, one microsatellite) were analyzed in 321 PTB cases and 346 controls from Guinea-Bissau and 280 PTB cases and 286 controls from The Gambia. For replication we studied 281 case and 179 control African-American samples and 221 cases and 144 controls of European ancestry from the US and Argentina. First-stage single locus analyses revealed signals of association at IL12B 3′ UTR SNP rs3212227 (unadjusted allelic p = 0.04; additive genotypic p = 0.05, OR = 0.78, 95% CI [0.61–0.99]) in Guinea-Bissau and rs11574790 (unadjusted allelic p = 0.05; additive genotypic p = 0.05, OR = 0.76, 95% CI [0.58–1.00]) in The Gambia. Association of rs3212227 was then replicated in African-Americans (rs3212227 allelic p = 0.002; additive genotypic p = 0.05, OR = 0.78, 95% CI [0.61–1.00]); most importantly, in the African-American cohort, multiple significant signals of association (seven of the nine polymorphisms tested) were detected throughout the gene. These data suggest that genetic variation in IL12B, a highly relevant candidate gene, is a risk factor for PTB in populations of African ancestry, although further studies will be required to confirm this association and identify the precise mechanism underlying it

    Genome-wide association and epidemiological analyses reveal common genetic origins between uterine leiomyomata and endometriosis.

    Get PDF
    Uterine leiomyomata (UL) are the most common neoplasms of the female reproductive tract and primary cause for hysterectomy, leading to considerable morbidity and high economic burden. Here we conduct a GWAS meta-analysis in 35,474 cases and 267,505 female controls of European ancestry, identifying eight novel genome-wide significant (P < 5 × 10-8) loci, in addition to confirming 21 previously reported loci, including multiple independent signals at 10 loci. Phenotypic stratification of UL by heavy menstrual bleeding in 3409 cases and 199,171 female controls reveals genome-wide significant associations at three of the 29 UL loci: 5p15.33 (TERT), 5q35.2 (FGFR4) and 11q22.3 (ATM). Four loci identified in the meta-analysis are also associated with endometriosis risk; an epidemiological meta-analysis across 402,868 women suggests at least a doubling of risk for UL diagnosis among those with a history of endometriosis. These findings increase our understanding of genetic contribution and biology underlying UL development, and suggest overlapping genetic origins with endometriosis.This study was supported by the U.S. National Institutes of Health (NIH)/Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) grant HD060530 to C.C.M. C.C.M. is also supported by the NIHR Manchester Biomedical Research Centre. N.M. acknowledges support from the Academy of Finland (295693) and Orion Research Foundation. H.R.H. is supported by NIH K22 CA193860. T.F. is supported by the NIHR Biomedical Research Centre, Oxford. S.E.M. is supported by the National Health and Medical Research Council (NHMRC) Fellowship Scheme (1103623)

    Using genetic variation and environmental risk factor data to identify individuals at high risk for age-related macular degeneration

    Get PDF
    A major goal of personalized medicine is to pre-symptomatically identify individuals at high risk for disease using knowledge of each individual's particular genetic profile and constellation of environmental risk factors. With the identification of several well-replicated risk factors for age-related macular degeneration (AMD), the leading cause of legal blindness in older adults, this previously unreachable goal is beginning to seem less elusive. However, recently developed algorithms have either been much less accurate than expected, given the strong effects of the identified risk factors, or have not been applied to independent datasets, leaving unknown how well they would perform in the population at large. We sought to increase accuracy by using novel modeling strategies, including multifactor dimensionality reduction (MDR) and grammatical evolution of neural networks (GENN), in addition to the traditional logistic regression approach. Furthermore, we rigorously designed and tested our models in three distinct datasets: a Vanderbilt-Miami (VM) clinic-based case-control dataset, a VM family dataset, and the population-based Age-related Maculopathy Ancillary (ARMA) Study cohort. Using a consensus approach to combine the results from logistic regression and GENN models, our algorithm was successful in differentiating between high- and low-risk groups (sensitivity 77.0%, specificity 74.1%). In the ARMA cohort, the positive and negative predictive values were 63.3% and 70.7%, respectively. We expect that future efforts to refine this algorithm by increasing the sample size available for model building, including novel susceptibility factors as they are discovered, and by calibrating the model for diverse populations will improve accuracy

    Genome-wide association and epidemiological analyses reveal common genetic origins between uterine leiomyomata and endometriosis

    Get PDF
    Uterine leiomyomata (UL) are the most common neoplasms of the female reproductive tract and primary cause for hysterectomy, leading to considerable morbidity and high economic burden. Here we conduct a GWAS meta-analysis in 35,474 cases and 267,505 female controls of European ancestry, identifying eight novel genome-wide significant (P < 5 × 10−8) loci, in addition to confirming 21 previously reported loci, including multiple independent signals at 10 loci. Phenotypic stratification of UL by heavy menstrual bleeding in 3409 cases and 199,171 female controls reveals genome-wide significant associations at three of the 29 UL loci: 5p15.33 (TERT), 5q35.2 (FGFR4) and 11q22.3 (ATM). Four loci identified in the meta-analysis are also associated with endometriosis risk; an epidemiological meta-analysis across 402,868 women suggests at least a doubling of risk for UL diagnosis among those with a history of endometriosis. These findings increase our understanding of genetic contribution and biology underlying UL development, and suggest overlapping genetic origins with endometriosis

    A transcriptome-wide association study among 97,898 women to identify candidate susceptibility genes for epithelial ovarian cancer risk

    Get PDF
    Large-scale genome-wide association studies (GWAS) have identified approximately 35 loci associated with epithelial ovarian cancer (EOC) risk. The majority of GWAS-identified disease susceptibility variants are located in non-coding regions, and causal genes underlying these associations remain largely unknown. Here we performed a transcriptome-wide association study to search for novel genetic loci and plausible causal genes at known GWAS loci. We used RNA sequencing data (68 normal ovarian-tissue samples from 68 individuals and 6,124 cross-tissue samples from 369 individuals) and high-density genotyping data from European descendants of the Genotype-Tissue Expression (GTEx V6) project to build ovarian and cross-tissue models of genetically regulated expression using elastic net methods. We evaluated 17,121 genes for their cis-predicted gene expression in relation to EOC risk using summary statistics data from GWAS of 97,898 women, including 29,396 EOC cases. With a Bonferroni-corrected significance level of P<2.2×10-6, we identified 35 genes including FZD4 at 11q14.2 (Z=5.08, P=3.83×10-7, the cross-tissue model; 1 Mb away from any GWAS-identified EOC risk variant), a potential novel locus for EOC risk. All other 34 significantly-associated genes were located within 1 Mb of known GWAS-identified loci, including 23 genes at 6 loci not previously linked to EOC risk. Upon conditioning on nearby known EOC GWAS-identified variants, the associations for 31 genes disappeared and 3 genes remained (P<1.47 x 10-3). These data identify one novel locus (FZD4) and 34 genes at 13 known EOC risk loci associated with EOC risk, providing new insights into EOC carcinogenesis

    Identification of 12 new susceptibility loci for different histotypes of epithelial ovarian cancer.

    Get PDF
    To identify common alleles associated with different histotypes of epithelial ovarian cancer (EOC), we pooled data from multiple genome-wide genotyping projects totaling 25,509 EOC cases and 40,941 controls. We identified nine new susceptibility loci for different EOC histotypes: six for serous EOC histotypes (3q28, 4q32.3, 8q21.11, 10q24.33, 18q11.2 and 22q12.1), two for mucinous EOC (3q22.3 and 9q31.1) and one for endometrioid EOC (5q12.3). We then performed meta-analysis on the results for high-grade serous ovarian cancer with the results from analysis of 31,448 BRCA1 and BRCA2 mutation carriers, including 3,887 mutation carriers with EOC. This identified three additional susceptibility loci at 2q13, 8q24.1 and 12q24.31. Integrated analyses of genes and regulatory biofeatures at each locus predicted candidate susceptibility genes, including OBFC1, a new candidate susceptibility gene for low-grade and borderline serous EOC
    corecore