25 research outputs found

    Comprehensive analyses of DNA repair pathways, smoking and bladder cancer risk in Los Angeles and Shanghai

    Get PDF
    Tobacco smoking is a bladder cancer risk factor and a source of carcinogens that induce DNA damage to urothelial cells. Using data and samples from 988 cases and 1,004 controls enrolled in the Los Angeles County Bladder Cancer Study and the Shanghai Bladder Cancer Study, we investigated associations between bladder cancer risk and 632 tagSNPs that comprehensively capture genetic variation in 28 DNA repair genes from four DNA repair pathways: base excision repair (BER), nucleotide excision repair (NER), non-homologous end-joining (NHEJ) and homologous recombination repair (HHR). Odds ratios (ORs) and 95% confidence intervals (CIs) for each tagSNP were corrected for multiple testing for all SNPs within each gene using pACT and for genes within each pathway and across pathways with Bonferroni. Gene and pathway summary estimates were obtained using ARTP. We observed an association between bladder cancer and POLB rs7832529 (BER) (pACT = 0.003; ppathway = 0.021) among all, and SNPs in XPC (NER) and OGG1 (BER) among Chinese men and women, respectively. The NER pathway showed an overall association with risk among Chinese males (ARTP NER p = 0.034). The XRCC6 SNP rs2284082 (NHEJ), also in LD with SREBF2, showed an interaction with smoking (smoking status interaction pgene = 0.001, ppathway = 0.008, poverall = 0.034). Our findings support a role in bladder carcinogenesis for regions that map close to or within BER (POLB, OGG1) and NER genes (XPC). A SNP that tags both the XRCC6 and SREBF2 genes strongly modifies the association between bladder cancer risk and smoking

    LD-Spline: Mapping SNPs on genotyping platforms to genomic regions using patterns of linkage disequilibrium

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene-centric analysis tools for genome-wide association study data are being developed both to annotate single locus statistics and to prioritize or group single nucleotide polymorphisms (SNPs) prior to analysis. These approaches require knowledge about the relationships between SNPs on a genotyping platform and genes in the human genome. SNPs in the genome can represent broader genomic regions via linkage disequilibrium (LD), and population-specific patterns of LD can be exploited to generate a data-driven map of SNPs to genes.</p> <p>Methods</p> <p>In this study, we implemented LD-Spline, a database routine that defines the genomic boundaries a particular SNP represents using linkage disequilibrium statistics from the International HapMap Project. We compared the LD-Spline haplotype block partitioning approach to that of the four gamete rule and the Gabriel et al. approach using simulated data; in addition, we processed two commonly used genome-wide association study platforms.</p> <p>Results</p> <p>We illustrate that LD-Spline performs comparably to the four-gamete rule and the Gabriel et al. approach; however as a SNP-centric approach LD-Spline has the added benefit of systematically identifying a genomic boundary for each SNP, where the global block partitioning approaches may falter due to sampling variation in LD statistics.</p> <p>Conclusion</p> <p>LD-Spline is an integrated database routine that quickly and effectively defines the genomic region marked by a SNP using linkage disequilibrium, with a SNP-centric block definition algorithm.</p

    Functional informed genome-wide interaction analysis of body mass index, diabetes and colorectal cancer risk

    Get PDF
    BACKGROUND: Body mass index (BMI) and diabetes are established risk factors for colorectal cancer (CRC), likely through perturbations in metabolic traits (e.g. insulin resistance and glucose homeostasis). Identification of interactions between variation in genes and these metabolic risk factors may identify novel biologic insights into CRC etiology. METHODS: To improve statistical power and interpretation for gene-environment interaction (G × E) testing, we tested genetic variants that regulate expression of a gene together for interaction with BMI (kg/m2 ) and diabetes on CRC risk among 26 017 cases and 20 692 controls. Each variant was weighted based on PrediXcan analysis of gene expression data from colon tissue generated in the Genotype-Tissue Expression Project for all genes with heritability ≥1%. We used a mixed-effects model to jointly measure the G × E interaction in a gene by partitioning the interactions into the predicted gene expression levels (fixed effects), and residual G × E effects (random effects). G × BMI analyses were stratified by sex as BMI-CRC associations differ by sex. We used false discovery rates to account for multiple comparisons and reported all results with FDR <0.2. RESULTS: Among 4839 genes tested, genetically predicted expressions of FOXA1 (P = 3.15 × 10-5 ), PSMC5 (P = 4.51 × 10-4 ) and CD33 (P = 2.71 × 10-4 ) modified the association of BMI on CRC risk for men; KIAA0753 (P = 2.29 × 10-5 ) and SCN1B (P = 2.76 × 10-4 ) modified the association of BMI on CRC risk for women; and PTPN2 modified the association between diabetes and CRC risk in both sexes (P = 2.31 × 10-5 ). CONCLUSIONS: Aggregating G × E interactions and incorporating functional information, we discovered novel genes that may interact with BMI and diabetes on CRC risk

    A Novel Kernel for Correcting Size Bias in the Logistic Kernel Machine Test with an Application to Rheumatoid Arthritis

    Full text link
    OBJECTIVES: The logistic kernel machine test (LKMT) is a testing procedure tailored towards high-dimensional genetic data. Its use in pathway analyses of GWA case-control studies results from its computational efficiency and flexibility of incorporating additional information via the kernel. The kernel can be any positive definite function; unfortunately its form strongly influences the power and bias. Most authors have recommended the use of the simple linear kernel. We demonstrate via a simulation that the probability of rejecting the null hypothesis of no association just by chance increases with the number of SNPs or genes in the pathway when applying this kernel. METHODS: We propose a novel kernel that includes an appropriate standardization, in order to protect against any inflation of false positive results. Moreover, our novel kernel contains information on gene membership of SNPs in the pathway. RESULTS: In an application to data from the NARAC Rheumatoid Arthritis Consortium, we find that even this basic genomic structure can improve the ability of the LKMT to identify meaningful associations. We also demonstrate that the standardization effectively eliminates problems with size bias. CONCLUSION: We recommend the use of our standardized kernel and urge caution when using non-adjusted kernels in the LKMT to conduct pathway analysis
    corecore