86 research outputs found

    Compositional data analysis with Red-R

    Get PDF
    The compositional analyst must use a series of software to transform raw compositional data and run statistical analyses on them. Tools for compositional data analysis are available in R, an open source widely-used statistical computing environment. However, using R requires prior programming knowledge. Red-R is an open-source, user-friendly visual data flow interface based on R. The interface uses principles of pipeline programming where functions are represented as icons, termed widgets, and data flows from one function to another by drawing lines between them on a canvas. Red-R is able to perform common data analysis tasks (hypothesis tests, analysis of variance, regressions, principal component analysis, data cloud plots, bar plots, biplots, etc.). We have developed a novel Red-R package which implements the compositions package in R. Our compositions package can be used to perform compositional data operations over raw data (closure, additive, centered and isometric log ratio transformations, perturbations and powering, etc.), and create compositional plots (ternary diagrams, ilrdendrograms, etc.) without prior programming knowledge, after few basic operations. The objective of this work is to present Red-R and its compositions package using an application example for geochemical data. The network of widgets provides an easy-tofollow step-by-step procedure to run a large number of operations available in R, hence facilitating the tasks of the compositional data analyst. Furthermore, the entire analysis network can be saved and reloaded. Reports can be generated from the widget network to document and share results. Non-programmers can have an easy access to the advanced tools available in compositions analysis

    Ampullary cancers harbor ELF3 tumor suppressor gene mutations and exhibit frequent WNT dysregulation

    Get PDF
    The ampulla of Vater is a complex cellular environment from which adenocarcinomas arise to form a group of histopathologically heterogenous tumors. To evaluate the molecular features of these tumors, 98 ampullary adenocarcinomas were evaluated and compared to 44 distal bile duct and 18 duodenal adenocarcinomas. Genomic analyses revealed mutations in the WNT signaling pathway among half of the patients and in all three adenocarcinomas irrespective of their origin and histological morphology. These tumors were characterized by a high frequency of inactivating mutations of ELF3, a high rate of microsatellite instability, and common focal deletions and amplifications, suggesting common attributes in the molecular pathogenesis are at play in these tumors. The high frequency of WNT pathway activating mutation, coupled with small-molecule inhibitors of β-catenin in clinical trials, suggests future treatment decisions for these patients may be guided by genomic analysis

    Enhanced metastatic risk assessment in cutaneous squamous cell carcinoma with the 40-gene expression profile test

    Get PDF
    Aim: To clinically validate the 40-gene expression profile (40-GEP) test for cutaneous squamous cell carcinoma patients and evaluate coupling the test with individual clinicopathologic risk factor-based assessment methods. Patients & methods: In a 33-site study, primary tumors with known patient outcomes were assessed under clinical testing conditions (n = 420). The 40-GEP results were integrated with clinicopathologic risk factors. Kaplan–Meier and Cox regression analyses were performed for metastasis. Results: The 40-GEP test demonstrated significant prognostic value. Risk classification was improved via integration of 40-GEP results with clinicopathologic risk factor-based assessment, with metastasis rates near the general cutaneous squamous cell carcinoma population for Class 1 and ≥50% for Class 2B. Conclusion: Combining molecular profiling with clinicopathologic risk factor assessment enhances stratification of cutaneous squamous cell carcinoma patients and may inform decision-making for risk-appropriate management strategies

    Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas

    Get PDF
    This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs) from five sites associated with smokin

    Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context

    Get PDF
    Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts

    Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images

    Get PDF
    Beyond sample curation and basic pathologic characterization, the digitized H&E-stained images of TCGA samples remain underutilized. To highlight this resource, we present mappings of tumorinfiltrating lymphocytes (TILs) based on H&E images from 13 TCGA tumor types. These TIL maps are derived through computational staining using a convolutional neural network trained to classify patches of images. Affinity propagation revealed local spatial structure in TIL patterns and correlation with overall survival. TIL map structural patterns were grouped using standard histopathological parameters. These patterns are enriched in particular T cell subpopulations derived from molecular measures. TIL densities and spatial structure were differentially enriched among tumor types, immune subtypes, and tumor molecular subtypes, implying that spatial infiltrate state could reflect particular tumor cell aberration states. Obtaining spatial lymphocytic patterns linked to the rich genomic characterization of TCGA samples demonstrates one use for the TCGA image archives with insights into the tumor-immune microenvironment

    Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas

    Get PDF
    Although theMYConcogene has been implicated incancer, a systematic assessment of alterations ofMYC, related transcription factors, and co-regulatoryproteins, forming the proximal MYC network (PMN),across human cancers is lacking. Using computa-tional approaches, we define genomic and proteo-mic features associated with MYC and the PMNacross the 33 cancers of The Cancer Genome Atlas.Pan-cancer, 28% of all samples had at least one ofthe MYC paralogs amplified. In contrast, the MYCantagonists MGA and MNT were the most frequentlymutated or deleted members, proposing a roleas tumor suppressors.MYCalterations were mutu-ally exclusive withPIK3CA,PTEN,APC,orBRAFalterations, suggesting that MYC is a distinct onco-genic driver. Expression analysis revealed MYC-associated pathways in tumor subtypes, such asimmune response and growth factor signaling; chro-matin, translation, and DNA replication/repair wereconserved pan-cancer. This analysis reveals insightsinto MYC biology and is a reference for biomarkersand therapeutics for cancers with alterations ofMYC or the PMN

    The repertoire of mutational signatures in human cancer.

    Get PDF
    Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature1. Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses3-15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated-but distinct-DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer
    • …
    corecore