1,011 research outputs found

    Expansion of airway basal epithelial cells from primary human non-small cell lung cancer tumors

    Get PDF
    Pre-clinical non-small cell lung cancer (NSCLC) models are poorly representative of the considerable inter- and intra-tumor heterogeneity of the disease in patients. Primary cell-based in vitro models of NSCLC are therefore desirable for novel therapy development and personalized cancer medicine. Methods have been described to generate rapidly proliferating epithelial cell cultures from multiple human epithelia using 3T3-J2 feeder cell culture in the presence of Y-27632, a RHO-associated protein kinase (ROCK) inhibitor, in what are known as "conditional reprograming conditions" (CRC) or 3T3+Y. In some cancer studies, variations of this methodology have allowed primary tumor cell expansion across a number of cancer types but other studies have demonstrated the preferential expansion of normal epithelial cells from tumors in such conditions. Here, we report our experience regarding the derivation of primary NSCLC cell cultures from 12 lung adenocarcinoma patients enrolled in the Tracking Cancer Evolution through Therapy (TRACERx) clinical study and discuss these in the context of improving the success rate for in vitro cultivation of cells from NSCLC tumors. This article is protected by copyright. All rights reserved

    Quantifying single nucleotide variant detection sensitivity in exome sequencing

    Get PDF
    BACKGROUND: The targeted capture and sequencing of genomic regions has rapidly demonstrated its utility in genetic studies. Inherent in this technology is considerable heterogeneity of target coverage and this is expected to systematically impact our sensitivity to detect genuine polymorphisms. To fully interpret the polymorphisms identified in a genetic study it is often essential to both detect polymorphisms and to understand where and with what probability real polymorphisms may have been missed. RESULTS: Using down-sampling of 30 deeply sequenced exomes and a set of gold-standard single nucleotide variant (SNV) genotype calls for each sample, we developed an empirical model relating the read depth at a polymorphic site to the probability of calling the correct genotype at that site. We find that measured sensitivity in SNV detection is substantially worse than that predicted from the naive expectation of sampling from a binomial. This calibrated model allows us to produce single nucleotide resolution SNV sensitivity estimates which can be merged to give summary sensitivity measures for any arbitrary partition of the target sequences (nucleotide, exon, gene, pathway, exome). These metrics are directly comparable between platforms and can be combined between samples to give “power estimates” for an entire study. We estimate a local read depth of 13X is required to detect the alleles and genotype of a heterozygous SNV 95% of the time, but only 3X for a homozygous SNV. At a mean on-target read depth of 20X, commonly used for rare disease exome sequencing studies, we predict 5–15% of heterozygous and 1–4% of homozygous SNVs in the targeted regions will be missed. CONCLUSIONS: Non-reference alleles in the heterozygote state have a high chance of being missed when commonly applied read coverage thresholds are used despite the widely held assumption that there is good polymorphism detection at these coverage levels. Such alleles are likely to be of functional importance in population based studies of rare diseases, somatic mutations in cancer and explaining the “missing heritability” of quantitative traits

    Assessing the causal role of epigenetic clocks in the development of multiple cancers: a Mendelian randomization study

    Get PDF
    Background: Epigenetic clocks have been associated with cancer risk in several observational studies. Nevertheless, it is unclear whether they play a causal role in cancer risk or if they act as a non-causal biomarker. Methods: We conducted a two-sample Mendelian randomization (MR) study to examine the genetically predicted effects of epigenetic age acceleration as measured by HannumAge (nine single-nucleotide polymorphisms (SNPs)), Horvath Intrinsic Age (24 SNPs), PhenoAge (11 SNPs), and GrimAge (4 SNPs) on multiple cancers (i.e. breast, prostate, colorectal, ovarian and lung cancer). We obtained genome-wide association data for biological ageing from a meta-analysis (N = 34,710), and for cancer from the UK Biobank (N cases = 2671-13,879; N controls = 173,493-372,016), FinnGen (N cases = 719-8401; N controls = 74,685-174,006) and several international cancer genetic consortia (N cases = 11,348-122,977; N controls = 15,861-105,974). Main analyses were performed using multiplicative random effects inverse variance weighted (IVW) MR. Individual study estimates were pooled using fixed effect meta-analysis. Sensitivity analyses included MR-Egger, weighted median, weighted mode and Causal Analysis using Summary Effect Estimates (CAUSE) methods, which are robust to some of the assumptions of the IVW approach. Results: Meta-analysed IVW MR findings suggested that higher GrimAge acceleration increased the risk of colorectal cancer (OR = 1.12 per year increase in GrimAge acceleration, 95% CI 1.04-1.20, p = 0.002). The direction of the genetically predicted effects was consistent across main and sensitivity MR analyses. Among subtypes, the genetically predicted effect of GrimAge acceleration was greater for colon cancer (IVW OR = 1.15, 95% CI 1.09-1.21, p = 0.006), than rectal cancer (IVW OR = 1.05, 95% CI 0.97-1.13, p = 0.24). Results were less consistent for associations between other epigenetic clocks and cancers. Conclusions: GrimAge acceleration may increase the risk of colorectal cancer. Findings for other clocks and cancers were inconsistent. Further work is required to investigate the potential mechanisms underlying the results. Funding: FMB was supported by a Wellcome Trust PhD studentship in Molecular, Genetic and Lifecourse Epidemiology (224982/Z/22/Z which is part of grant 218495/Z/19/Z). KKT was supported by a Cancer Research UK (C18281/A29019) programme grant (the Integrative Cancer Epidemiology Programme) and by the Hellenic Republic's Operational Programme 'Competitiveness, Entrepreneurship & Innovation' (OΠΣ 5047228). PH was supported by Cancer Research UK (C18281/A29019). RMM was supported by the NIHR Biomedical Research Centre at University Hospitals Bristol and Weston NHS Foundation Trust and the University of Bristol and by a Cancer Research UK (C18281/A29019) programme grant (the Integrative Cancer Epidemiology Programme). RMM is a National Institute for Health Research Senior Investigator (NIHR202411). The views expressed are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care. GDS and CLR were supported by the Medical Research Council (MC_UU_00011/1 and MC_UU_00011/5, respectively) and by a Cancer Research UK (C18281/A29019) programme grant (the Integrative Cancer Epidemiology Programme). REM was supported by an Alzheimer's Society project grant (AS-PG-19b-010) and NIH grant (U01 AG-18-018, PI: Steve Horvath). RCR is a de Pass Vice Chancellor's Research Fellow at the University of Bristol

    Promoter-anchored chromatin interactions predicted from genetic analysis of epigenomic data

    Get PDF
    Promoter-anchored chromatin interactions (PAIs) play a pivotal role in transcriptional regulation. Current high-throughput technologies for detecting PAIs, such as promoter capture Hi-C, are not scalable to large cohorts. Here, we present an analytical approach that uses summary-level data from cohort-based DNA methylation (DNAm) quantitative trait locus (mQTL) studies to predict PAIs. Using mQTL data from human peripheral blood ([Formula: see text]), we predict 34,797 PAIs which show strong overlap with the chromatin contacts identified by previous experimental assays. The promoter-interacting DNAm sites are enriched in enhancers or near expression QTLs. Genes whose promoters are involved in PAIs are more actively expressed, and gene pairs with promoter-promoter interactions are enriched for co-expression. Integration of the predicted PAIs with GWAS data highlight interactions among 601 DNAm sites associated with 15 complex traits. This study demonstrates the use of mQTL data to predict PAIs and provides insights into the role of PAIs in complex trait variation

    Genetic determinants of co-accessible chromatin regions in activated T cells across humans.

    Get PDF
    Over 90% of genetic variants associated with complex human traits map to non-coding regions, but little is understood about how they modulate gene regulation in health and disease. One possible mechanism is that genetic variants affect the activity of one or more cis-regulatory elements leading to gene expression variation in specific cell types. To identify such cases, we analyzed ATAC-seq and RNA-seq profiles from stimulated primary CD4+ T cells in up to 105 healthy donors. We found that regions of accessible chromatin (ATAC-peaks) are co-accessible at kilobase and megabase resolution, consistent with the three-dimensional chromatin organization measured by in situ Hi-C in T cells. Fifteen percent of genetic variants located within ATAC-peaks affected the accessibility of the corresponding peak (local-ATAC-QTLs). Local-ATAC-QTLs have the largest effects on co-accessible peaks, are associated with gene expression and are enriched for autoimmune disease variants. Our results provide insights into how natural genetic variants modulate cis-regulatory elements, in isolation or in concert, to influence gene expression

    Alu pair exclusions in the human genome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The human genome contains approximately one million <it>Alu </it>elements which comprise more than 10% of human DNA by mass. <it>Alu </it>elements possess direction, and are distributed almost equally in positive and negative strand orientations throughout the genome. Previously, it has been shown that closely spaced <it>Alu </it>pairs in opposing orientation (inverted pairs) are found less frequently than <it>Alu </it>pairs having the same orientation (direct pairs). However, this imbalance has only been investigated for <it>Alu </it>pairs separated by 650 or fewer base pairs (bp) in a study conducted prior to the completion of the draft human genome sequence.</p> <p>Results</p> <p>We performed a comprehensive analysis of all (> 800,000) full-length <it>Alu </it>elements in the human genome. This large sample size permits detection of small differences in the ratio between inverted and direct <it>Alu </it>pairs (I:D). We have discovered a significant depression in the full-length <it>Alu </it>pair I:D ratio that extends to repeat pairs separated by ≤ 350,000 bp. Within this imbalance bubble (those <it>Alu </it>pairs separated by ≤ 350,000 bp), direct pairs outnumber inverted pairs. Using PCR, we experimentally verified several examples of inverted <it>Alu </it>pair exclusions that were caused by deletions.</p> <p>Conclusions</p> <p>Over 50 million full-length <it>Alu </it>pairs reside within the I:D imbalance bubble. Their collective impact may represent one source of <it>Alu </it>element-related human genomic instability that has not been previously characterized.</p

    MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The interactions of multiple single nucleotide polymorphisms (SNPs) are highly hypothesized to affect an individual's susceptibility to complex diseases. Although many works have been done to identify and quantify the importance of multi-SNP interactions, few of them could handle the genome wide data due to the combinatorial explosive search space and the difficulty to statistically evaluate the high-order interactions given limited samples.</p> <p>Results</p> <p>Three comparative experiments are designed to evaluate the performance of MegaSNPHunter. The first experiment uses synthetic data generated on the basis of epistasis models. The second one uses a genome wide study on Parkinson disease (data acquired by using Illumina HumanHap300 SNP chips). The third one chooses the rheumatoid arthritis study from Wellcome Trust Case Control Consortium (WTCCC) using Affymetrix GeneChip 500K Mapping Array Set. MegaSNPHunter outperforms the best solution in this area and reports many potential interactions for the two real studies.</p> <p>Conclusion</p> <p>The experimental results on both synthetic data and two real data sets demonstrate that our proposed approach outperforms the best solution that is currently available in handling large-scale SNP data both in terms of speed and in terms of detection of potential interactions that were not identified before. To our knowledge, MegaSNPHunter is the first approach that is capable of identifying the disease-associated SNP interactions from WTCCC studies and is promising for practical disease prognosis.</p
    corecore