120 research outputs found

    Distribution-based measures of tumor heterogeneity are sensitive to mutation calling and lack strong clinical predictive power.

    Get PDF
    Mutant allele frequency distributions in cancer samples have been used to estimate intratumoral heterogeneity and its implications for patient survival. However, mutation calls are sensitive to the calling algorithm. It remains unknown whether the relationship of heterogeneity and clinical outcome is robust to these variations. To resolve this question, we studied the robustness of allele frequency distributions to the mutation callers MuTect, SomaticSniper, and VarScan in 4722 cancer samples from The Cancer Genome Atlas. We observed discrepancies among the results, particularly a pronounced difference between allele frequency distributions called by VarScan and SomaticSniper. Survival analysis showed little robust predictive power for heterogeneity as measured by Mutant-Allele Tumor Heterogeneity (MATH) score, with the exception of uterine corpus endometrial carcinoma. However, we found that variations in mutant allele frequencies were mediated by variations in copy number. Our results indicate that the clinical predictions associated with MATH score are primarily caused by copy number aberrations that alter mutant allele frequencies. Finally, we present a mathematical model of linear tumor evolution demonstrating why MATH score is insufficient for distinguishing different scenarios of tumor growth. Our findings elucidate the importance of allele frequency distributions as a measure for tumor heterogeneity and their prognostic role

    Pan-cancer classifications of tumor histological images using deep learning

    Get PDF
    Histopathological images are essential for the diagnosis of cancer type and selection of optimal treatment. However, the current clinical process of manual inspection of images is time consuming and prone to intra- and inter-observer variability. Here we show that key aspects of cancer image analysis can be performed by deep convolutional neural networks (CNNs) across a wide spectrum of cancer types. In particular, we implement CNN architectures based on Google Inception v3 transfer learning to analyze 27815 H&E slides from 23 cohorts in The Cancer Genome Atlas in studies of tumor/normal status, cancer subtype, and mutation status. For 19 solid cancer types we are able to classify tumor/normal status of whole slide images with extremely high AUCs (0.995±0.008). We are also able to classify cancer subtypes within 10 tissue types with AUC values well above random expectations (micro-average 0.87±0.1). We then perform a cross-classification analysis of tumor/normal status across tumor types. We find that classifiers trained on one type are often effective in distinguishing tumor from normal in other cancer types, with the relationships among classifiers matching known cancer tissue relationships. For the more challenging problem of mutational status, we are able to classify TP53 mutations in three cancer types with AUCs from 0.65-0.80 using a fully-trained CNN, and with similar cross-classification accuracy across tissues. These studies demonstrate the power of CNNs for not only classifying histopathological images in diverse cancer types, but also for revealing shared biology between tumors. We have made software available at: https://github.com/javadnoorb/HistCNNFirst author draf

    Mutations in DNA repair genes are associated with increased neo-antigen load and activated T cell infiltration in lung adenocarcinoma.

    Get PDF
    Mutations in DNA repair genes lead to increased genomic instability and mutation frequency. These mutations represent potential biomarkers for cancer immunotherapy efficacy, as high tumor mutational burden has been associated with increased neo-antigens and tumor infiltrating lymphocytes. While mismatch repair mutations have successfully predicted response to anti-PD-1 therapy in colorectal and other cancers, they have not yet been tested for lung cancer, and few have investigated genes from other DNA repair pathways. We utilized TCGA samples to comprehensively immunophenotype lung tumors and analyze the links between DNA repair mutations, neo-antigen and total mutational burden, and tumor immune infiltration. Overall, 73% of lung tumors contained infiltration by at least one T cell subset, with high mutational burden tumors containing significantly increased infiltration by activated CD4 and CD8 T cells. Further, mutations in mismatch repair genes, homologous recombination genes, or POLE accurately predicted increased tumor mutational burden, neo-antigen load, and T cell infiltration. Finally, neo-antigen load correlated with expression of M1-polarized macrophage genes, PD-1, PD-L1, IFNγ, GZMB, and FASLG, among other immune-related genes. Overall, after defining the immune infiltrate in lung tumors, we demonstrate the potential value of utilizing gene mutations from multiple DNA repair pathways as biomarkers for lung cancer immunotherapy. Oncotarget 2018; 9(8):7949-796

    Fostering bioinformatics education through skill development of professors: Big Genomic Data Skills Training for Professors.

    Get PDF
    Bioinformatics has become an indispensable part of life science over the past 2 decades. However, bioinformatics education is not well integrated at the undergraduate level, especially in liberal arts colleges and regional universities in the United States. One significant obstacle pointed out by the Network for Integrating Bioinformatics into Life Sciences Education is the lack of faculty in the bioinformatics area. Most current life science professors did not acquire bioinformatics analysis skills during their own training. Consequently, a great number of undergraduate and graduate students do not get the chance to learn bioinformatics or computational biology skills within a structured curriculum during their education. To address this gap, we developed a module-based, week-long short course to train small college and regional university professors with essential bioinformatics skills. The bioinformatics modules were built to be adapted by the professor-trainees afterward and used in their own classes. All the course materials can be accessed at https://github.com/TheJacksonLaboratory/JAXBD2K-ShortCourse

    Mutations in DNA repair genes are associated with increased neoantigen burden and a distinct immunophenotype in lung squamous cell carcinoma.

    Get PDF
    Deficiencies in DNA repair pathways, including mismatch repair (MMR), have been linked to higher tumor mutation burden and improved response to immune checkpoint inhibitors. However, the significance of MMR mutations in lung cancer has not been well characterized, and the relevance of other processes, including homologous recombination (HR) and polymerase epsilon (POLE) activity, remains unclear. Here, we analyzed a dataset of lung squamous cell carcinoma samples from The Cancer Genome Atlas. Variants in DNA repair genes were associated with increased tumor mutation and neoantigen burden, which in turn were linked with greater tumor infiltration by activated T cells. The subset of tumors with DNA repair gene variants but without T cell infiltration exhibited upregulation of TGF-β and Wnt pathway genes, and a combined score incorporating these genes and DNA repair status accurately predicted immune cell infiltration. Finally, high neoantigen burden was positively associated with genes related to cytolytic activity and immune checkpoints. These findings provide evidence that DNA repair pathway defects and immunomodulatory genes together lead to specific immunophenotypes in lung squamous cell carcinoma and could potentially serve as biomarkers for immunotherapy
    corecore