51 research outputs found

    I am hiQ—a novel pair of accuracy indices for imputed genotypes

    Get PDF
    Background: Imputation of untyped markers is a standard tool in genome-wide association studies to close the gap between directly genotyped and other known DNA variants. However, high accuracy with which genotypes are imputed is fundamental. Several accuracy measures have been proposed and some are implemented in imputation software, unfortunately diversely across platforms. In the present paper, we introduce Iam hiQ, an independent pair of accuracy measures that can be applied to dosage files, the output of all imputation software. Iam (imputation accuracy measure) quantifies the average amount of individual-specific versus population-specific genotype information in a linear manner. hiQ (heterogeneity in quantities of dosages) addresses the inter-individual heterogeneity between dosages of a marker across the sample at hand. Results: Applying both measures to a large case–control sample of the International Lung Cancer Consortium (ILCCO), comprising 27,065 individuals, we found meaningful thresholds for Iam and hiQ suitable to classify markers of poor accuracy. We demonstrate how Manhattan-like plots and moving averages of Iam and hiQ can be useful to identify regions enriched with less accurate imputed markers, whereas these regions would by missed when applying the accuracy measure info (implemented in IMPUTE2). Conclusion: We recommend using Iam hiQ additional to other accuracy scores for variant filtering before stepping into the analysis of imputed GWAS data

    Gene–gene interaction of AhRwith and within the Wntcascade affects susceptibility to lung cancer

    Get PDF
    Background: Aberrant Wnt signalling, regulating cell development and stemness, influences the development of many cancer types. The Aryl hydrocarbon receptor (AhR) mediates tumorigenesis of environmental pollutants. Complex interaction patterns of genes assigned to AhR/Wnt-signalling were recently associated with lung cancer susceptibility. Aim: To assess the association and predictive ability of AhR/Wnt-genes with lung cancer in cases and controls of European descent. Methods: Odds ratios (OR) were estimated for genomic variants assigned to the Wnt agonist and the antagonistic genes DKK2, DKK3, DKK4, FRZB, SFRP4 and Axin2. Logistic regression models with variable selection were trained, validated and tested to predict lung cancer, at which other previously identified SNPs that have been robustly associated with lung cancer risk could also enter the model. Furthermore, decision trees were created to investigate variant × variant interaction. All analyses were performed for overall lung cancer and for subgroups. Results: No genome-wide significant association of AhR/Wnt-genes with overall lung cancer was observed, but within the subgroups of ever smokers (e.g., maker rs2722278 SFRP4; OR = 1.20; 95% CI 1.13–1.27; p = 5.6 × 10–10) and never smokers (e.g., maker rs1133683 Axin2; OR = 1.27; 95% CI 1.19–1.35; p = 1.0 × 10–12). Although predictability is poor, AhR/Wnt-variants are unexpectedly overrepresented in optimized prediction scores for overall lung cancer and for small cell lung cancer. Remarkably, the score for never-smokers contained solely two AhR/Wnt-variants. The optimal decision tree for never smokers consists of 7 AhR/Wnt-variants and only two lung cancer variants. Conclusions: The role of variants belonging to Wnt/AhR-pathways in lung cancer susceptibility may be underrated in main-effects association analysis. Complex interaction patterns in individuals of European descent have moderate predictive capacity for lung cancer or subgroups thereof, especially in never smokers

    Gene–gene interaction of AhRwith and within the Wntcascade affects susceptibility to lung cancer

    Get PDF
    Background Aberrant Wnt signalling, regulating cell development and stemness, influences the development of many cancer types. The Aryl hydrocarbon receptor (AhR) mediates tumorigenesis of environmental pollutants. Complex interaction patterns of genes assigned to AhR/Wnt-signalling were recently associated with lung cancer susceptibility. Aim To assess the association and predictive ability of AhR/Wnt-genes with lung cancer in cases and controls of European descent. Methods Odds ratios (OR) were estimated for genomic variants assigned to the Wnt agonist and the antagonistic genes DKK2, DKK3, DKK4, FRZB, SFRP4 and Axin2. Logistic regression models with variable selection were trained, validated and tested to predict lung cancer, at which other previously identified SNPs that have been robustly associated with lung cancer risk could also enter the model. Furthermore, decision trees were created to investigate variant x variant interaction. All analyses were performed for overall lung cancer and for subgroups. Results No genome-wide significant association of AhR/Wnt-genes with overall lung cancer was observed, but within the subgroups of ever smokers (e.g., maker rs2722278 SFRP4; OR = 1.20; 95% CI 1.13-1.27; p = 5.6 x 10(-10)) and never smokers (e.g., maker rs1133683 Axin2; OR = 1.27; 95% CI 1.19-1.35; p = 1.0 x 10(-12)). Although predictability is poor, AhR/Wnt-variants are unexpectedly overrepresented in optimized prediction scores for overall lung cancer and for small cell lung cancer. Remarkably, the score for never-smokers contained solely two AhR/Wnt-variants. The optimal decision tree for never smokers consists of 7 AhR/Wnt-variants and only two lung cancer variants. Conclusions The role of variants belonging to Wnt/AhR-pathways in lung cancer susceptibility may be underrated in main-effects association analysis. Complex interaction patterns in individuals of European descent have moderate predictive capacity for lung cancer or subgroups thereof, especially in never smokers

    Impact on Disease Development, Genomic Location and Biological Function of Copy Number Alterations in Non-Small Cell Lung Cancer

    Get PDF
    Lung cancer, of which more than 80% is non-small cell, is the leading cause of cancer-related death in the United States. Copy number alterations (CNAs) in lung cancer have been shown to be positionally clustered in certain genomic regions. However, it remains unclear whether genes with copy number changes are functionally clustered. Using a dense single nucleotide polymorphism array, we performed genome-wide copy number analyses of a large collection of non-small cell lung tumors (n = 301). We proposed a formal statistical test for CNAs between different groups (e.g., non-involved lung vs. tumors, early vs. late stage tumors). We also customized the gene set enrichment analysis (GSEA) algorithm to investigate the overrepresentation of genes with CNAs in predefined biological pathways and gene sets (i.e., functional clustering). We found that CNAs events increase substantially from germline, early stage to late stage tumor. In addition to genomic position, CNAs tend to occur away from the gene locations, especially in germline, non-involved tissue and early stage tumors. Such tendency decreases from germline to early stage and then to late stage tumors, suggesting a relaxation of selection during tumor progression. Furthermore, genes with CNAs in non-small cell lung tumors were enriched in certain gene sets and biological pathways that play crucial roles in oncogenesis and cancer progression, demonstrating the functional aspect of CNAs in the context of biological pathways that were overlooked previously. We conclude that CNAs increase with disease progression and CNAs are both positionally and functionally clustered. The potential functional capabilities acquired via CNAs may be sufficient for normal cells to transform into malignant cells

    Immune-mediated genetic pathways resulting in pulmonary function impairment increase lung cancer susceptibility

    Get PDF
    Impaired lung function is often caused by cigarette smoking, making it challenging to disentangle its role in lung cancer susceptibility. Investigation of the shared genetic basis of these phenotypes in the UK Biobank and International Lung Cancer Consortium (29,266 cases, 56,450 controls) shows that lung cancer is genetically correlated with reduced forced expiratory volume in one second (FEV1: r(g) = 0.098, p = 2.3 x 10(-8)) and the ratio of FEV1 to forced vital capacity (FEV1/FVC: r(g) = 0.137, p = 2.0 x 10(-12)). Mendelian randomization analyses demonstrate that reduced FEV1 increases squamous cell carcinoma risk (odds ratio (OR) = 1.51, 95% confidence intervals: 1.21-1.88), while reduced FEV1/FVC increases the risk of adenocarcinoma (OR = 1.17, 1.01-1.35) and lung cancer in never smokers (OR = 1.56, 1.05-2.30). These findings support a causal role of pulmonary impairment in lung cancer etiology. Integrative analyses reveal that pulmonary function instruments, including 73 novel variants, influence lung tissue gene expression and implicate immune-related pathways in mediating the observed effects on lung carcinogenesis

    Informed Conditioning on Clinical Covariates Increases Power in Case-Control Association Studies

    Get PDF
    Genetic case-control association studies often include data on clinical covariates, such as body mass index (BMI), smoking status, or age, that may modify the underlying genetic risk of case or control samples. For example, in type 2 diabetes, odds ratios for established variants estimated from low–BMI cases are larger than those estimated from high–BMI cases. An unanswered question is how to use this information to maximize statistical power in case-control studies that ascertain individuals on the basis of phenotype (case-control ascertainment) or phenotype and clinical covariates (case-control-covariate ascertainment). While current approaches improve power in studies with random ascertainment, they often lose power under case-control ascertainment and fail to capture available power increases under case-control-covariate ascertainment. We show that an informed conditioning approach, based on the liability threshold model with parameters informed by external epidemiological information, fully accounts for disease prevalence and non-random ascertainment of phenotype as well as covariates and provides a substantial increase in power while maintaining a properly controlled false-positive rate. Our method outperforms standard case-control association tests with or without covariates, tests of gene x covariate interaction, and previously proposed tests for dealing with covariates in ascertained data, with especially large improvements in the case of case-control-covariate ascertainment. We investigate empirical case-control studies of type 2 diabetes, prostate cancer, lung cancer, breast cancer, rheumatoid arthritis, age-related macular degeneration, and end-stage kidney disease over a total of 89,726 samples. In these datasets, informed conditioning outperforms logistic regression for 115 of the 157 known associated variants investigated (P-value = 1×10−9). The improvement varied across diseases with a 16% median increase in χ2 test statistics and a commensurate increase in power. This suggests that applying our method to existing and future association studies of these diseases may identify novel disease loci

    Replication of Lung Cancer Susceptibility Loci at Chromosomes 15q25, 5p15, and 6p21: A Pooled Analysis From the International Lung Cancer Consortium

    Get PDF
    Background Genome-wide association studies have identified three chromosomal regions at 15q25, 5p15, and 6p21 as being associated with the risk of lung cancer. To confirm these associations in independent studies and investigate heterogeneity of these associations within specific subgroups, we conducted a coordinated genotyping study within the International Lung Cancer Consortium based on independent studies that were not included in previous genome-wide association studies. Methods Genotype data for single-nucleotide polymorphisms at chromosomes 15q25 (rs16969968, rs8034191), 5p15 (rs2736100, rs402710), and 6p21 (rs2256543, rs4324798) from 21 case-control studies for 11 645 lung cancer case patients and 14 954 control subjects, of whom 85% were white and 15% were Asian, were pooled. Associations between the variants and the risk of lung cancer were estimated by logistic regression models. All statistical tests were two-sided. Results Associations between 15q25 and the risk of lung cancer were replicated in white ever-smokers (rs16969968: odds ratio [OR] = 1.26, 95% confidence interval [CI] = 1.21 to 1.32, Ptrend = 2 × 10−26), and this association was stronger for those diagnosed at younger ages. There was no association in never-smokers or in Asians between either of the 15q25 variants and the risk of lung cancer. For the chromosome 5p15 region, we confirmed statistically significant associations in whites for both rs2736100 (OR = 1.15, 95% CI = 1.10 to 1.20, Ptrend = 1 × 10−10) and rs402710 (OR = 1.14, 95% CI = 1.09 to 1.19, Ptrend = 5 × 10−8) and identified similar associations in Asians (rs2736100: OR = 1.23, 95% CI = 1.12 to 1.35, Ptrend = 2 × 10−5; rs402710: OR = 1.15, 95% CI = 1.04 to 1.27, Ptrend = .007). The associations between the 5p15 variants and lung cancer differed by histology; odds ratios for rs2736100 were highest in adenocarcinoma and for rs402710 were highest in adenocarcinoma and squamous cell carcinomas. This pattern was observed in both ethnic groups. Neither of the two variants on chromosome 6p21 was associated with the risk of lung cancer. Conclusions In this international genetic association study of lung cancer, previous associations found in white populations were replicated and new associations were identified in Asian populations. Future genetic studies of lung cancer should include detailed stratification by histolog

    Lung Cancer Risk in Never-Smokers of European Descent is Associated With Genetic Variation in the 5(p)15.33 TERT-CLPTM1Ll Region

    Get PDF
    Introduction: Inherited susceptibility to lung cancer risk in never-smokers is poorly understood. The major reason for this gap in knowledge is that this disease is relatively uncommon (except in Asians), making it difficult to assemble an adequate study sample. In this study we conducted a genome-wide association study on the largest, to date, set of European-descent never-smokers with lung cancer. Methods: We conducted a two-phase (discovery and replication) genome-wide association study in never-smokers of European descent. We further augmented the sample by performing a meta-analysis with never-smokers from the recent OncoArray study, which resulted in a total of 3636 cases and 6295 controls. We also compare our findings with those in smokers with lung cancer. Results: We detected three genome-wide statistically significant single nucleotide polymorphisms rs31490 (odds ratio [OR]: 0.769, 95% confidence interval [CI]: 0.722-0.820; p value 5.31 x 10(-16)), rs380286 (OR: 0.770, 95% CI: 0.723-0.820; p value 4.32 x 10(-16)), and rs4975616 OR: 0.778, 95% CI: 0.730-0.829; p value 1.04 x 10(-14)). All three mapped to Chromosome 5 CLPTM1L-TERT region, previously shown to be associated with lung cancer risk in smokers and in never-smoker Asian women, and risk of other cancers including breast, ovarian, colorectal, and prostate. Conclusions: We found that genetic susceptibility to lung cancer in never-smokers is associated to genetic variants with pan-cancer risk effects. The comparison with smokers shows that top variants previously shown to be associated with lung cancer risk only confer risk in the presence of tobacco exposure, underscoring the importance of gene-environment interactions in the etiology of this disease. (C) 2019 International Association for the Study of Lung Cancer. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

    CYP2A6 activity and cigarette consumption interact in smoking-related lung cancer susceptibility.

    Get PDF
    Cigarette smoke, containing both nicotine and carcinogens, causes lung cancer. However, not all smokers develop lung cancer, highlighting the importance of the interaction between host susceptibility and environmental exposure in tumorigenesis. Here, we aimed to delineate the interaction between metabolizing ability of tobacco carcinogens and smoking intensity in mediating genetic susceptibility to smoking-related lung tumorigenesis. Single-variant and gene-based associations of 43 tobacco carcinogen-metabolizing genes with lung cancer were analyzed using summary statistics and individual-level genetic data, followed by causal inference of Mendelian randomization, mediation analysis, and structural equation modeling. Cigarette smoke-exposed cell models were used to detect gene expression patterns in relation to specific alleles. Data from the International Lung Cancer Consortium (29,266 cases and 56,450 controls) and UK Biobank (2,155 cases and 376,329 controls) indicated that the genetic variant rs56113850 C>T located in intron 4 of CYP2A6 was significantly associated with decreased lung cancer risk among smokers [odds ratio (OR) = 0.88, 95% confidence interval = 0.85-0.91, P = 2.18×10-16], which might interact (Pinteraction = 0.028) with and partially be mediated (ORindirect = 0.987) by smoking status. Smoking intensity accounted for 82.3% of the effect of CYP2A6 activity on lung cancer risk but entirely mediated the genetic effect of rs56113850. Mechanistically, the rs56113850 T allele rescued the downregulation of CYP2A6 caused by cigarette smoke exposure, potentially through preferential recruitment of transcription factor HLTF. Together, this study provides additional insights into the interplay between host susceptibility and carcinogen exposure in smoking-related lung tumorigenesis
    • …
    corecore