227 research outputs found

    Nonnegative/binary matrix factorization with a D-Wave quantum annealer

    Full text link
    D-Wave quantum annealers represent a novel computational architecture and have attracted significant interest, but have been used for few real-world computations. Machine learning has been identified as an area where quantum annealing may be useful. Here, we show that the D-Wave 2X can be effectively used as part of an unsupervised machine learning method. This method can be used to analyze large datasets. The D-Wave only limits the number of features that can be extracted from the dataset. We apply this method to learn the features from a set of facial images

    The repertoire of mutational signatures in human cancer

    Get PDF
    Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature(1). Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium(2) of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses(3-15), enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated-but distinct-DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer.Peer reviewe

    An overview of mutational and copy number signatures in human cancer

    Get PDF
    The genome of each cell in the human body is constantly under assault from a plethora of exogenous and endogenous processes that can damage DNA. If not successfully repaired, DNA damage generally becomes permanently imprinted in cells, and all their progenies, as somatic mutations. In most cases, the patterns of these somatic mutations contain the tell-tale signs of the mutagenic processes that have imprinted and are termed mutational signatures. Recent pan-cancer genomic analyses have elucidated the compendium of mutational signatures for all types of small mutational events, including: (i) single base substitutions; (ii) doublet base substitutions; and (iii) small insertions/deletions. In contrast to small mutational events, where, in most cases, DNA damage is a prerequisite, aneuploidy, which refers to the abnormal number of chromosomes in a cell, usually develops from mistakes during DNA replication. Such mistakes include DNA replication stress, mitotic errors caused by faulty microtubule dynamics, or cohesion defects that contribute to chromosomal breakage and can lead to copy number alterations or even to structural rearrangements. These aberrations also leave behind genomic scars which can be inferred from sequencing as copy number signatures and rearrangement signatures. The analyses of mutational signatures of small mutational events have been extensively reviewed [1-3], so we will not comprehensively re-examine them here. Rather, our focus will be on summarizing the existing knowledge for mutational signatures of copy number alterations. As studying copy number signatures is an emerging field, we briefly summarize the utility that mutational signatures of small mutational events have provided in basic science, cancer treatment, and cancer prevention and we emphasize the future role that copy number signatures may play in each of these fields

    Deciphering signatures of mutational processes operative in human cancer.

    Get PDF
    The genome of a cancer cell carries somatic mutations that are the cumulative consequences of the DNA damage and repair processes operative during the cellular lineage between the fertilized egg and the cancer cell. Remarkably, these mutational processes are poorly characterized. Global sequencing initiatives are yielding catalogs of somatic mutations from thousands of cancers, thus providing the unique opportunity to decipher the signatures of mutational processes operative in human cancer. However, until now there have been no theoretical models describing the signatures of mutational processes operative in cancer genomes and no systematic computational approaches are available to decipher these mutational signatures. Here, by modeling mutational processes as a blind source separation problem, we introduce a computational framework that effectively addresses these questions. Our approach provides a basis for characterizing mutational signatures from cancer-derived somatic mutational catalogs, paving the way to insights into the pathogenetic mechanism underlying all cancers

    A mutational signature in gastric cancer suggests therapeutic strategies.

    Get PDF
    Targeting defects in the DNA repair machinery of neoplastic cells, for example, those due to inactivating BRCA1 and/or BRCA2 mutations, has been used for developing new therapies in certain types of breast, ovarian and pancreatic cancers. Recently, a mutational signature was associated with failure of double-strand DNA break repair by homologous recombination based on its high mutational burden in samples harbouring BRCA1 or BRCA2 mutations. In pancreatic cancer, all responders to platinum therapy exhibit this mutational signature including a sample that lacked any defects in BRCA1 or BRCA2. Here, we examine 10,250 cancer genomes across 36 types of cancer and demonstrate that, in addition to breast, ovarian and pancreatic cancers, gastric cancer is another cancer type that exhibits this mutational signature. Our results suggest that 7-12% of gastric cancers have defective double-strand DNA break repair by homologous recombination and may benefit from either platinum therapy or PARP inhibitors

    DNA dynamics play a role as a basal transcription factor in the positioning and regulation of gene transcription initiation

    Get PDF
    We assess the role of DNA breathing dynamics as a determinant of promoter strength and transcription start site (TSS) location. We compare DNA Langevin dynamic profiles of representative gene promoters, calculated with the extended non-linear PBD model of DNA with experimental data on transcription factor binding and transcriptional activity. Our results demonstrate that DNA dynamic activity at the TSS can be suppressed by mutations that do not affect basal transcription factor binding–DNA contacts. We use this effect to establish the separate contributions of transcription factor binding and DNA dynamics to transcriptional activity. Our results argue against a purely ‘transcription factor-centric’ view of transcription initiation, suggesting that both DNA dynamics and transcription factor binding are necessary conditions for transcription initiation

    The genome as a record of environmental exposure.

    Get PDF
    Whole genome sequencing of human tumours has revealed distinct patterns of mutation that hint at the causative origins of cancer. Experimental investigations of the mutations and mutation spectra induced by environmental mutagens have traditionally focused on single genes. With the advent of faster cheaper sequencing platforms, it is now possible to assess mutation spectra in experimental models across the whole genome. As a proof of principle, we have examined the whole genome mutation profiles of mouse embryo fibroblasts immortalised following exposure to benzo[a]pyrene (BaP), ultraviolet light (UV) and aristolochic acid (AA). The results reveal that each mutagen induces a characteristic mutation signature: predominantly G→T mutations for BaP, C→T and CC→TT for UV and A→T for AA. The data are not only consistent with existing knowledge but also provide additional information at higher levels of genomic organisation. The approach holds promise for identifying agents responsible for mutations in human tumours and for shedding light on the aetiology of human cancer

    Mapping clustered mutations in cancer reveals APOBEC3 mutagenesis of ecDNA

    Get PDF
    Clustered somatic mutations are common in cancer genomes and previous analyses reveal several types of clustered single-base substitutions, which include doublet- and multi-base substitutions1–5, diffuse hypermutation termed omikli6, and longer strand-coordinated events termed kataegis3,7–9. Here we provide a comprehensive characterization of clustered substitutions and clustered small insertions and deletions (indels) across 2,583 whole-genome-sequenced cancers from 30 types of cancer10. Clustered mutations were highly enriched in driver genes and associated with differential gene expression and changes in overall survival. Several distinct mutational processes gave rise to clustered indels, including signatures that were enriched in tobacco smokers and homologous-recombination-deficient cancers. Doublet-base substitutions were caused by at least 12 mutational processes, whereas most multi-base substitutions were generated by either tobacco smoking or exposure to ultraviolet light. Omikli events, which have previously been attributed to APOBEC3 activity6, accounted for a large proportion of clustered substitutions; however, only 16.2% of omikli matched APOBEC3 patterns. Kataegis was generated by multiple mutational processes, and 76.1% of all kataegic events exhibited mutational patterns that are associated with the activation-induced deaminase (AID) and APOBEC3 family of deaminases. Co-occurrence of APOBEC3 kataegis and extrachromosomal DNA (ecDNA), termed kyklonas (Greek for cyclone), was found in 31% of samples with ecDNA. Multiple distinct kyklonic events were observed on most mutated ecDNA. ecDNA containing known cancer genes exhibited both positive selection and kyklonic hypermutation. Our results reveal the diversity of clustered mutational processes in human cancer and the role of APOBEC3 in recurrently mutating and fuelling the evolution of ecDNA
    corecore