8 research outputs found

    The landscape of somatic mutations in infant MLL-rearranged acute lymphoblastic leukemias.

    Full text link
    Infant acute lymphoblastic leukemia (ALL) with MLL rearrangements (MLL-R) represents a distinct leukemia with a poor prognosis. To define its mutational landscape, we performed whole-genome, exome, RNA and targeted DNA sequencing on 65 infants (47 MLL-R and 18 non-MLL-R cases) and 20 older children (MLL-R cases) with leukemia. Our data show that infant MLL-R ALL has one of the lowest frequencies of somatic mutations of any sequenced cancer, with the predominant leukemic clone carrying a mean of 1.3 non-silent mutations. Despite this paucity of mutations, we detected activating mutations in kinase-PI3K-RAS signaling pathway components in 47% of cases. Surprisingly, these mutations were often subclonal and were frequently lost at relapse. In contrast to infant cases, MLL-R leukemia in older children had more somatic mutations (mean of 6.5 mutations/case versus 1.3 mutations/case, P = 7.15 × 10(-5)) and had frequent mutations (45%) in epigenetic regulators, a category of genes that, with the exception of MLL, was rarely mutated in infant MLL-R ALL

    The landscape of somatic mutations in infant MLL-rearranged acute lymphoblastic leukemias

    No full text
    Includes 3 unnumbered pages at the end of the article. Published online 2 March 2015Infant acute lymphoblastic leukemia (ALL) with MLL rearrangements (MLL-R) represents a distinct leukemia with a poor prognosis. To define its mutational landscape, we performed whole-genome, exome, RNA and targeted DNA sequencing on 65 infants (47 MLL-R and 18 non-MLL-R cases) and 20 older children (MLL-R cases) with leukemia. Our data show that infant MLL-R ALL has one of the lowest frequencies of somatic mutations of any sequenced cancer, with the predominant leukemic clone carrying a mean of 1.3 non-silent mutations. Despite this paucity of mutations, we detected activating mutations in kinase-PI3K-RAS signaling pathway components in 47% of cases. Surprisingly, these mutations were often subclonal and were frequently lost at relapse. In contrast to infant cases, MLL-R leukemia in older children had more somatic mutations (mean of 6.5 mutations/case versus 1.3 mutations/case, P = 7.15 × 10(-5)) and had frequent mutations (45%) in epigenetic regulators, a category of genes that, with the exception of MLL, was rarely mutated in infant MLL-R ALL.Anna K Andersson ... Charles G Mullighan ... et al. for The St. Jude Children’s Research Hospital–Washington University Pediatric Cancer Genome Projec

    The genomic landscape of core-binding factor acute myeloid leukemias

    No full text
    Acute myeloid leukemia (AML) comprises a heterogeneous group of leukemias frequently defined by recurrent cytogenetic abnormalities, including rearrangements involving the core-binding factor (CBF) transcriptional complex. To better understand the genomic landscape of CBF-AMLs, we analyzed both pediatric (n = 87) and adult (n = 78) samples, including cases with RUNX1-RUNX1T1 (n = 85) or CBFB-MYH11 (n = 80) rearrangements, by whole-genome or whole-exome sequencing. In addition to known mutations in the Ras pathway, we identified recurrent stabilizing mutations in CCND2, suggesting a previously unappreciated cooperating pathway in CBF-AML. Outside of signaling alterations, RUNX1-RUNX1T1 and CBFB-MYH11 AMLs demonstrated remarkably different spectra of cooperating mutations, as RUNX1-RUNX1T1 cases harbored recurrent mutations in DHX15 and ZBTB7A, as well as an enrichment of mutations in epigenetic regulators, including ASXL2 and the cohesin complex. This detailed analysis provides insights into the pathogenesis and development of CBF-AML, while highlighting dramatic differences in the landscapes of cooperating mutations for these related AML subtypes.Zachary J Faber ... Charles G. Mullighan ... et al

    Towards complete and error-free genome assemblies of all vertebrate species

    Get PDF
    The Vertebrate Genome Project has used an optimized pipeline to generate high-quality genome assemblies for sixteen species (representing all major vertebrate classes), which have led to new biological insights. High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species(1-4). To address this issue, the international Genome 10K (G10K) consortium(5,6) has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences

    Towards complete and error-free genome assemblies of all vertebrate species

    No full text
    High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1–4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences

    Towards complete and error-free genome assemblies of all vertebrate species.

    Get PDF
    High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences
    corecore