21 research outputs found

    mPies: a novel metaproteomics tool for the creation of relevant protein databases and automatized protein annotation

    Get PDF
    Metaproteomics allows to decipher the structure and functionality of microbial communities. Despite its rapid development, crucial steps such as the creation of standardized protein search databases and reliable protein annotation remain challenging. To overcome those critical steps, we developed a new program named mPies (metaProteomics in environmental sciences). mPies allows the creation of protein databases derived from assembled or unassembled metagenomes, and/or public repositories based on taxon IDs, gene or protein names. For the first time, mPies facilitates the automatization of reliable taxonomic and functional consensus annotations at the protein group level, minimizing the well-known protein inference issue, which is commonly encountered in metaproteomics. mPies’ workflow is highly customizable with regards to input data, workflow steps, and parameter adjustment. mPies is implemented in Python 3/Snakemake and freely available on GitHub: https://github.com/johanneswerner/mPies/

    The genomic and transcriptional landscape of primary central nervous system lymphoma

    Get PDF
    Primary lymphomas of the central nervous system (PCNSL) are mainly diffuse large B-cell lymphomas (DLBCLs) confined to the central nervous system (CNS). Molecular drivers of PCNSL have not been fully elucidated. Here, we profile and compare the whole-genome and transcriptome landscape of 51 CNS lymphomas (CNSL) to 39 follicular lymphoma and 36 DLBCL cases outside the CNS. We find recurrent mutations in JAK-STAT, NFkB, and B-cell receptor signaling pathways, including hallmark mutations in MYD88 L265P (67%) and CD79B (63%), and CDKN2A deletions (83%). PCNSLs exhibit significantly more focal deletions of HLA-D (6p21) locus as a potential mechanism of immune evasion. Mutational signatures correlating with DNA replication and mitosis are significantly enriched in PCNSL. TERT gene expression is significantly higher in PCNSL compared to activated B-cell (ABC)-DLBCL. Transcriptome analysis clearly distinguishes PCNSL and systemic DLBCL into distinct molecular subtypes. Epstein-Barr virus (EBV)+ CNSL cases lack recurrent mutational hotspots apart from IG and HLA-DRB loci. We show that PCNSL can be clearly distinguished from DLBCL, having distinct expression profiles, IG expression and translocation patterns, as well as specific combinations of genetic alterations

    Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples

    No full text
    Funder: NCI U24CA211006Abstract: The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts

    Whole genome sequencing puts forward hypotheses on metastasis evolution and therapy in colorectal cancer

    Get PDF
    The evolution and genetic nature of metastatic lesions is not completely characterized. Here the authors perform a comprehensive whole-genome study of colorectal metastases in comparison to matched primary tumors and define a multistage progression model and metastasis-specific changes that, in part, are therapeutically actionable

    Mutational mechanisms shaping the coding and noncoding genome of germinal center derived B-cell lymphomas

    Get PDF
    B cells have the unique property to somatically alter their immunoglobulin (IG) genes by V(D)J recombination, somatic hypermutation (SHM) and class-switch recombination (CSR). Aberrant targeting of these mechanisms is implicated in lymphomagenesis, but the mutational processes are poorly understood. By performing whole genome and transcriptome sequencing of 181 germinal center derived B-cell lymphomas (gcBCL) we identified distinct mutational signatures linked to SHM and CSR. We show that not only SHM, but presumably also CSR causes off-target mutations in non-IG genes. Kataegis clusters with high mutational density mainly affected early replicating regions and were enriched for SHM- and CSR-mediated off-target mutations. Moreover, they often co-occurred in loci physically interacting in the nucleus, suggesting that mutation hotspots promote increased mutation targeting of spatially co-localized loci (termed hypermutation by proxy). Only around 1% of somatic small variants were in protein coding sequences, but in about half of the driver genes, a contribution of B-cell specific mutational processes to their mutations was found. The B-cell-specific mutational processes contribute to both lymphoma initiation and intratumoral heterogeneity. Overall, we demonstrate that mutational processes involved in the development of gcBCL are more complex than previously appreciated, and that B cell-specific mutational processes contribute via diverse mechanisms to lymphomagenesis

    Germline elongator mutations in Sonic Hedgehog medulloblastoma

    No full text
    Cancer genomics has revealed many genes and core molecular processes that contribute to human malignancies, but the genetic and molecular bases of many rare cancers remains unclear. Genetic predisposition accounts for 5 to 10% of cancer diagnoses in children(1,2), and genetic events that cooperate with known somatic driver events are poorly understood. Pathogenic germline variants in established cancer predisposition genes have been recently identified in 5% of patients with the malignant brain tumour medulloblastoma(3). Here, by analysing all protein-coding genes, we identify and replicate rare germline loss-of-function variants across ELP1 in 14% of paediatric patients with the medulloblastoma subgroup Sonic Hedgehog (MBSHH)(.)ELP1 was the most common medulloblastoma predisposition gene and increased the prevalence of genetic predisposition to 40% among paediatric patients with MBSHH. Parent-offspring and pedigree analyses identified two families with a history of paediatric medulloblastoma. ELP1-associated medulloblastomas were restricted to the molecular SHH alpha subtype(4) and characterized by universal biallelic inactivation of ELP1 owing to somatic loss of chromosome arm 9q. Most ELP1-associated medulloblastomas also exhibited somatic alterations in PTCH1, which suggests that germline ELP1 loss-of-function variants predispose individuals to tumour development in combination with constitutive activation of SHH signalling. ELP1 is the largest subunit of the evolutionarily conserved Elongator complex, which catalyses translational elongation through tRNA modifications at the wobble (U-34) position(5,6). Tumours from patients with ELP1-associated MBSHH were characterized by a destabilized Elongator complex, loss of Elongator-dependent tRNA modifications, codon-dependent translational reprogramming, and induction of the unfolded protein response, consistent with loss of protein homeostasis due to Elongator deficiency in model systems(7-9). Thus, genetic predisposition to proteome instability may be a determinant in the pathogenesis of paediatric brain cancers. These results support investigation of the role of protein homeostasis in other cancer types and potential for therapeutic interference.Germline mutations in the Elongator complex gene ELP1 predispose individuals to the development of childhood medulloblastoma

    Sex differences in oncogenic mutational processes

    Get PDF
    Sex differences have been observed in multiple facets of cancer epidemiology, treatment and biology, and in most cancers outside the sex organs. Efforts to link these clinical differences to specific molecular features have focused on somatic mutations within the coding regions of the genome. Here we report a pan-cancer analysis of sex differences in whole genomes of 1983 tumours of 28 subtypes as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium. We both confirm the results of exome studies, and also uncover previously undescribed sex differences. These include sex-biases in coding and non-coding cancer drivers, mutation prevalence and strikingly, in mutational signatures related to underlying mutational processes. These results underline the pervasiveness of molecular sex differences and strengthen the call for increased consideration of sex in molecular cancer research.Sex differences have been observed in multiple facets of cancer epidemiology, treatment and biology, and in most cancers outside the sex organs. Efforts to link these clinical differences to specific molecular features have focused on somatic mutations within the coding regions of the genome. Here we report a pan-cancer analysis of sex differences in whole genomes of 1983 tumours of 28 subtypes as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium. We both confirm the results of exome studies, and also uncover previously undescribed sex differences. These include sex-biases in coding and non-coding cancer drivers, mutation prevalence and strikingly, in mutational signatures related to underlying mutational processes. These results underline the pervasiveness of molecular sex differences and strengthen the call for increased consideration of sex in molecular cancer research.Peer reviewe
    corecore