4,521 research outputs found

    Typing tumors using pathways selected by somatic evolution.

    Get PDF
    Many recent efforts to analyze cancer genomes involve aggregation of mutations within reference maps of molecular pathways and protein networks. Here, we find these pathway studies are impeded by molecular interactions that are functionally irrelevant to cancer or the patient's tumor type, as these interactions diminish the contrast of driver pathways relative to individual frequently mutated genes. This problem can be addressed by creating stringent tumor-specific networks of biophysical protein interactions, identified by signatures of epistatic selection during tumor evolution. Using such an evolutionarily selected pathway (ESP) map, we analyze the major cancer genome atlases to derive a hierarchical classification of tumor subtypes linked to characteristic mutated pathways. These pathways are clinically prognostic and predictive, including the TP53-AXIN-ARHGEF17 combination in liver and CYLC2-STK11-STK11IP in lung cancer, which we validate in independent cohorts. This ESP framework substantially improves the definition of cancer pathways and subtypes from tumor genome data

    Absence of an embryonic stem cell DNA methylation signature in human cancer.

    Get PDF
    BackgroundDifferentiated cells that arise from stem cells in early development contain DNA methylation features that provide a memory trace of their fetal cell origin (FCO). The FCO signature was developed to estimate the proportion of cells in a mixture of cell types that are of fetal origin and are reminiscent of embryonic stem cell lineage. Here we implemented the FCO signature estimation method to compare the fraction of cells with the FCO signature in tumor tissues and their corresponding nontumor normal tissues.MethodsWe applied our FCO algorithm to discovery data sets obtained from The Cancer Genome Atlas (TCGA) and replication data sets obtained from the Gene Expression Omnibus (GEO) data repository. Wilcoxon rank sum tests, linear regression models with adjustments for potential confounders and non-parametric randomization-based tests were used to test the association of FCO proportion between tumor tissues and nontumor normal tissues. P-values of < 0.05 were considered statistically significant.ResultsAcross 20 different tumor types we observed a consistently lower FCO signature in tumor tissues compared with nontumor normal tissues, with 18 observed to have significantly lower FCO fractions in tumor tissue (total n = 6,795 tumor, n = 922 nontumor, P < 0.05). We replicated our findings in 15 tumor types using data from independent subjects in 15 publicly available data sets (total n = 740 tumor, n = 424 nontumor, P < 0.05).ConclusionsThe results suggest that cancer development itself is substantially devoid of recapitulation of normal embryologic processes. Our results emphasize the distinction between DNA methylation in normal tightly regulated stem cell driven differentiation and cancer stem cell reprogramming that involves altered methylation in the service of great cell heterogeneity and plasticity

    Pan-cancer classifications of tumor histological images using deep learning

    Get PDF
    Histopathological images are essential for the diagnosis of cancer type and selection of optimal treatment. However, the current clinical process of manual inspection of images is time consuming and prone to intra- and inter-observer variability. Here we show that key aspects of cancer image analysis can be performed by deep convolutional neural networks (CNNs) across a wide spectrum of cancer types. In particular, we implement CNN architectures based on Google Inception v3 transfer learning to analyze 27815 H&E slides from 23 cohorts in The Cancer Genome Atlas in studies of tumor/normal status, cancer subtype, and mutation status. For 19 solid cancer types we are able to classify tumor/normal status of whole slide images with extremely high AUCs (0.995±0.008). We are also able to classify cancer subtypes within 10 tissue types with AUC values well above random expectations (micro-average 0.87±0.1). We then perform a cross-classification analysis of tumor/normal status across tumor types. We find that classifiers trained on one type are often effective in distinguishing tumor from normal in other cancer types, with the relationships among classifiers matching known cancer tissue relationships. For the more challenging problem of mutational status, we are able to classify TP53 mutations in three cancer types with AUCs from 0.65-0.80 using a fully-trained CNN, and with similar cross-classification accuracy across tissues. These studies demonstrate the power of CNNs for not only classifying histopathological images in diverse cancer types, but also for revealing shared biology between tumors. We have made software available at: https://github.com/javadnoorb/HistCNNFirst author draf

    NELFE-Dependent MYC Signature Identifies a Unique Cancer Subtype in Hepatocellular Carcinoma.

    Get PDF
    The MYC oncogene is dysregulated in approximately 30% of liver cancer. In an effort to exploit MYC as a therapeutic target, including in hepatocellular carcinoma (HCC), strategies have been developed on the basis of MYC amplification or gene translocation. Due to the failure of these strategies to provide accurate diagnostics and prognostic value, we have developed a Negative Elongation Factor E (NELFE)-Dependent MYC Target (NDMT) gene signature. This signature, which consists of genes regulated by MYC and NELFE, an RNA binding protein that enhances MYC-induced hepatocarcinogenesis, is predictive of NELFE/MYC-driven tumors that would otherwise not be identified by gene amplification or translocation alone. We demonstrate the utility of the NDMT gene signature to predict a unique subtype of HCC, which is associated with a poor prognosis in three independent cohorts encompassing diverse etiologies, demographics, and viral status. The application of gene signatures, such as the NDMT signature, offers patients access to personalized risk assessments, which may be utilized to direct future care
    corecore