Search CORE

424 research outputs found

MuSiC: Identifying mutational significance in cancer genomes

Author: Callaway Matthew B.
Dees Nathan D.
Ding Li
Dooling David
Kandoth Cyriac
Kobolt Daniel C.
Mardis Elaine R.
Mooney Thomas B.
Schierding William
Wendl Michael C.
Wilson Richard K.
Zhang Qunyuan
Publication venue: Digital Commons@Becker
Publication date: 01/01/2012
Field of study

Massively parallel sequencing technology and the associated rapidly decreasing sequencing costs have enabled systemic analyses of somatic mutations in large cohorts of cancer cases. Here we introduce a comprehensive mutational analysis pipeline that uses standardized sequence-based inputs along with multiple types of clinical data to establish correlations among mutation sites, affected genes and pathways, and to ultimately separate the commonly abundant passenger mutations from the truly significant events. In other words, we aim to determine the Mutational Significance in Cancer (MuSiC) for these large data sets. The integration of analytical operations in the MuSiC framework is widely applicable to a broad set of tumor types and offers the benefits of automation as well as standardization. Herein, we describe the computational structure and statistical underpinnings of the MuSiC pipeline and demonstrate its performance using 316 ovarian cancer samples from the TCGA ovarian cancer project. MuSiC correctly confirms many expected results, and identifies several potentially novel avenues for discovery

Crossref

Digital Commons@Becker

PubMed Central

Final report on project SP1210: Lowland peatland systems in England and Wales – evaluating greenhouse gas fluxes and carbon balances

Author: Baird Andrew
Brown Emma
Burden Annette
Callaghan Nathan
Chapman Pippa
Cumming Alex
Dean Hannah
Dixon Simon
Dooling Gemma
Evans Chris
Evans Jonathan
Gauci Vincent
Grayson Richard
Haddaway Neal
He Yufeng
Heppell Kate
Holden Joseph
Hughes Steve
Jones Davey
Kaduk Jörg
Matthews Rachel
Menichino Nina
Misselbrook Tom
Morrison Ross
Page Sue
Pan Gong
Peacock Michael
Rayment Mark
Ridley Luke
Robinson Inma
Rylett Dan
Scowen Matthew
Stanley Kieran
Williamson Jenny
Worrall Fred
Publication venue: Centre for Ecology and Hydrology
Publication date: 01/01/2016
Field of study

Lowland peatlands represent one of the most carbon-rich ecosystems in the UK. As a result of widespread habitat modification and drainage to support agriculture and peat extraction, they have been converted from natural carbon sinks into major carbon sources, and are now amongst the largest sources of greenhouse gas (GHG) emissions from the UK land-use sector. Despite this, they have previously received relatively little policy attention, and measures to reduce GHG emissions either through re-wetting and restoration or improved management of agricultural land remain at a relatively early stage. In part, this has stemmed from a lack of reliable measurements on the carbon and GHG balance of UK lowland peatlands. This project aimed to address this evidence gap via an unprecedented programme of consistent, multi year field measurements at a total of 15 lowland peatland sites in England and Wales, ranging from conservation managed ‘near-natural’ ecosystems to intensively managed agricultural and extraction sites. The use of standardised measurement and data analysis protocols allowed the magnitude of GHG emissions and removals by peatlands to be quantified across this heterogeneous data set, and for controlling factors to be identified. The network of seven flux towers established during the project is believed to be unique on peatlands globally, and has provided new insights into the processes the control GHG fluxes in lowland peatlands. The work undertaken is intended to support the future development and implementation of agricultural management and restoration measures aimed at reducing the contribution of these important ecosystems to UK GHG emissions

University of Birmingham Research Portal

Open Research Online (The Open University)

Recommended from our members

A novel retinoblastoma therapy from genomic and epigenetic analyses.

Author: Bahrami Armita
Benavente Claudia A
Brennan Rachel
Chen Xiang
Ding Li
Dooling David J
Downing James R
Dyer Michael A
Dyson Nicholas J
Easton John
Ellison David
Flores-Otero Jacqueline
Fulton Lucinda L
Fulton Robert S
Gupta Pankaj
Hong Xin
Lu Charles
Ma Jing
Manning Amity L
Mardis Elaine R
McEvoy Justina
Mukatira Suraj
Mullighan Charles
Naeve Clayton
Neale Geoff
Ochoa Kerri
Pounds Stanley
Rusch Michael
Shurtleff Sheila
Ulyanov Anatoly
Wang Jianmin
Wilson Matthew
Wilson Richard K
Wu Gang
Zhang Jinghui
Zhao David
Publication venue: eScholarship, University of California
Publication date: 01/01/2012
Field of study

Retinoblastoma is an aggressive childhood cancer of the developing retina that is initiated by the biallelic loss of RB1. Tumours progress very quickly following RB1 inactivation but the underlying mechanism is not known. Here we show that the retinoblastoma genome is stable, but that multiple cancer pathways can be epigenetically deregulated. To identify the mutations that cooperate with RB1 loss, we performed whole-genome sequencing of retinoblastomas. The overall mutational rate was very low; RB1 was the only known cancer gene mutated. We then evaluated the role of RB1 in genome stability and considered non-genetic mechanisms of cancer pathway deregulation. For example, the proto-oncogene SYK is upregulated in retinoblastoma and is required for tumour cell survival. Targeting SYK with a small-molecule inhibitor induced retinoblastoma tumour cell death in vitro and in vivo. Thus, retinoblastomas may develop quickly as a result of the epigenetic deregulation of key cancer pathways as a direct or indirect result of RB1 loss

eScholarship - University of California

A vertebrate case study of the quality of assemblies derived from next-generation sequences

Author: Chen Lei
Dooling David J
Haub Kevin V
Hillier LaDeana W
Locke Devin P
Mardis Elaine R
Martin John C
Miller Jason R
Minx Patrick
Mitreva Makedonka
Thane Nay
Warren Wesley C
Weinstock George M
Wilson Richard K
Ye Liang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

The unparalleled efficiency of next-generation sequencing (NGS) has prompted widespread adoption, but significant problems remain in the use of NGS data for whole genome assembly. We explore the advantages and disadvantages of chicken genome assemblies generated using a variety of sequencing and assembly methodologies. NGS assemblies are equivalent in some ways to a Sanger-based assembly yet deficient in others. Nonetheless, these assemblies are sufficient for the identification of the majority of genes and can reveal novel sequences when compared to existing assembly references

Springer - Publisher Connector

PubMed Central

Digital Commons@Becker

UGD Academic Repository

Recommended from our members

A high-resolution map of human evolutionary constraint using 29 mammals.

Author: Alföldi Jessica
Baldwin Jen
Baylor College of Medicine Human Genome Sequencing Center Sequencing Team
Beal Kathryn
Birney Ewan
Bloom Toby
Broad Institute Sequencing Platform and Whole Genome Assembly Team
Chang Jean
Chin Chee Whye
Clamp Michele
Clawson Hiram
Cree Andrew
Cuff James
Delehaunty Kim
Di Palma Federica
Dihn Huyen H
Dooling David
Ernst Jason
Fitzgerald Stephen
Flicek Paul
Fowler Gerald
Fronik Catrina
Fulton Bob
Fulton Lucinda
Garber Manuel
Genome Institute at Washington University
Gibbs Richard A
Gnerre Sante
Goldman Nick
Graves Tina
Green Eric D
Guttman Mitchell
Haussler David
Heiman Dave
Herrero Javier
Holloway Alisha K
Hubisz Melissa J
Jaffe David B
Jhangiani Shalili
Jordan Gregory
Joshi Vandita
Jungreis Irwin
Kellis Manolis
Kent W James
Kheradpour Pouya
Kostka Dennis
Kovar Christie L
Lander Eric S
Lara Marcia
Lee Sandra
Lewis Lora R
Lin Michael F
Lindblad-Toh Kerstin
Lowe Craig B
Mardis Elaine R
Margulies Elliott H
Martins Andre L
Massingham Tim
Mauceli Evan
Minx Patrick
Moltke Ida
Muzny Donna M
Nazareth Lynne V
Nicol Robert
Nusbaum Chad
Okwuonu Geoffrey
Parker Brian J
Pedersen Jakob S
Pollard Katherine S
Raney Brian J
Rasmussen Matthew D
Robinson Jim
Santibanez Jireh
Siepel Adam
Sodergren Erica
Stark Alexander
Vilella Albert J
Ward Lucas D
Warren Wesley C
Washietl Stefan
Weinstock George M
Wen Jiayu
Wilkinson Jane
Wilson Richard K
Worley Kim C
Xie Xiaohui
Young Sarah
Zody Michael C
Zuk Or
Publication venue: eScholarship, University of California
Publication date: 01/10/2011
Field of study

The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease

eScholarship - University of California

Clonal architecture of secondary acute myeloid leukemia

Author: Abbott Rachel
Chen Ken
Ding Li
DiPersio John F
Dooling David
Eades William
Fan Xian
Frater John L
Fulton Robert
Graubert Timothy A
Grillot Marcus
Heath Sharon
Kalicki-Veizer Joelle
Koboldt Daniel C
Larson David E
Ley Timothy J
Link Daniel C
Magrini Vincent
Mardis Elaine R
McLellan Michael D
O\u27Laughlin Michelle
Schmidt Heather
Shao Jin
Shen Dong
Tomasson Michael
Walter Matthew J
Westervelt Peter
Wilson Richard K
Witowski Sarah
Publication venue: Digital Commons@Becker
Publication date: 01/01/2012
Field of study

BACKGROUND: The myelodysplastic syndromes are a group of hematologic disorders that often evolve into secondary acute myeloid leukemia (AML). The genetic changes that underlie progression from the myelodysplastic syndromes to secondary AML are not well understood. METHODS: We performed whole-genome sequencing of seven paired samples of skin and bone marrow in seven subjects with secondary AML to identify somatic mutations specific to secondary AML. We then genotyped a bone marrow sample obtained during the antecedent myelodysplastic-syndrome stage from each subject to determine the presence or absence of the specific somatic mutations. We identified recurrent mutations in coding genes and defined the clonal architecture of each pair of samples from the myelodysplastic-syndrome stage and the secondary-AML stage, using the allele burden of hundreds of mutations. RESULTS: Approximately 85% of bone marrow cells were clonal in the myelodysplastic-syndrome and secondary-AML samples, regardless of the myeloblast count. The secondary-AML samples contained mutations in 11 recurrently mutated genes, including 4 genes that have not been previously implicated in the myelodysplastic syndromes or AML. In every case, progression to acute leukemia was defined by the persistence of an antecedent founding clone containing 182 to 660 somatic mutations and the outgrowth or emergence of at least one subclone, harboring dozens to hundreds of new mutations. All founding clones and subclones contained at least one mutation in a coding gene. CONCLUSIONS: Nearly all the bone marrow cells in patients with myelodysplastic syndromes and secondary AML are clonally derived. Genetic evolution of secondary AML is a dynamic process shaped by multiple cycles of mutation acquisition and clonal selection. Recurrent gene mutations are found in both founding clones and daughter subclones. (Funded by the National Institutes of Health and others.

Crossref

Digital Commons@Becker

PubMed Central

Water-level dynamics in natural and artificial pools in blanket peatlands

Author: Andersen Roxane
Baird Andy J.
Billett Mike F.
Chapman Pippa J.
Dinsmore Kerry J.
Dooling Gemma
Gee Clare
Grayson Richard P.
Holden Joseph
McKenzie Rebecca
Moody Catherine S.
Turner T. Edward
Publication venue: 'Wiley'
Publication date: 01/02/2018
Field of study

Perennial pools are common natural features of peatlands and their hydrological functioning and turnover may be important for carbon fluxes, aquatic ecology and downstream water quality. Peatland restoration methods such as ditch blocking result in many new pools. However, little is known about the hydrological function of either pool type. We monitored six natural and six artificial pools on a Scottish blanket peatland. Pool water levels were more variable in all seasons in artificial pools having greater water level increases and faster recession responses to storms than natural pools. Pools overflowed by a median of 9 and 54 times pool volume per year for natural and artificial pools respectively but this varied widely because some large pools had small upslope catchments and vice versa. Mean peat water-table depths were similar between natural and artificial pool sites but much more variable over time at the artificial pool site, possibly due to a lower bulk specific yield across this site. Pool levels and pool-level fluctuations were not the same as those of local water tables in the adjacent peat. Pool level time-series were much smoother, with more damped rainfall or recession responses than those for peat water tables. There were strong hydraulic gradients between the peat and pools, with absolute water tables often being 20-30 cm higher or lower than water levels in pools only 1-4 m away. However, as peat hydraulic conductivity was very low (median of 1.5×10-5 and 1.4×10-6 cm s-1 at 30 and 50 cm depths at the natural pool site) there was little deep subsurface flow interaction. We conclude that: 1) for peat restoration projects, a larger total pool surface area is likely to result in smaller flood peaks downstream, at least during summer months, because peatland bulk specific yield will be greater; and 2) surface and near-surface connectivity during storm events and topographic context, rather than pool size alone, must be taken into account in future peatland pool and stream chemistry studies

Crossref

Stirling Online Research Repository (RIOXX)

Stirling Online Research Repository

White Rose Research Online

NERC Open Research Archive

Design and implementation of a generalized laboratory data model

Abstract Background Investigators in the biological sciences continue to exploit laboratory automation methods and have dramatically increased the rates at which they can generate data. In many environments, the methods themselves also evolve in a rapid and fluid manner. These observations point to the importance of robust information management systems in the modern laboratory. Designing and implementing such systems is non-trivial and it appears that in many cases a database project ultimately proves unserviceable. Results We describe a general modeling framework for laboratory data and its implementation as an information management system. The model utilizes several abstraction techniques, focusing especially on the concepts of inheritance and meta-data. Traditional approaches commingle event-oriented data with regular entity data in <it>ad hoc </it>ways. Instead, we define distinct regular entity and event schemas, but fully integrate these via a standardized interface. The design allows straightforward definition of a "processing pipeline" as a sequence of events, obviating the need for separate workflow management systems. A layer above the event-oriented schema integrates events into a workflow by defining "processing directives", which act as automated project managers of items in the system. Directives can be added or modified in an almost trivial fashion, i.e., without the need for schema modification or re-certification of applications. Association between regular entities and events is managed via simple "many-to-many" relationships. We describe the programming interface, as well as techniques for handling input/output, process control, and state transitions. Conclusion The implementation described here has served as the Washington University Genome Sequencing Center's primary information system for several years. It handles all transactions underlying a throughput rate of about 9 million sequencing reactions of various kinds per month and has handily weathered a number of major pipeline reconfigurations. The basic data model can be readily adapted to other high-volume processing environments.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Genome modeling system: A knowledge management platform for genomics

Author: Abbott Benjamin S
Abbott Travis E
Ainscough Benjamin J
Belter Edward A
Brummett Anthony M
Burnett Mark M
Callaway Matthew B
Carmichael Lynn K
Chen Ken
Clark Eric
Coffman Adam C
Das Indraniel
Dees Nathan D
Derickson Brian R
Ding Li
Dooling David J
Du Feiyu
Dukes Adam
Eldred James M
Fan Xian
Ferguson Ian T
Griffith Malachi
Griffith Obi L
Harris Christopher C
Hawkins Amy E
Helper Todd G
Hundal Jasreet
Kandoth Cyriac
Kim Kyung H
Kiwala Michael J
Koboldt Daniel C
Larson David E
Leonard Shawn M
Lolofie Justin T
Long Robert L
Lu Charles
Magrini Vincent J
Maher Christopher A
Maher Nicole
Mardis Elaine R
McLellan Michael D
McMichael Joshua F
Miller Christopher A
Mooney Thomas P
Morton David L
Nutter Nathaniel G
Oberkfell Ben J
Peck Joshua B
Pohl Craig S
Ramu Avinash
Regier Allison A
Sanderson Gabriel E
Schierding William S
Schroeder William E
Shi Xiaoqi
Skidmore Zachary L
Smith Scott M
Stiehr Gary
Walker Jason R
Weible James V
Weil Matthew R
Wilson Richard K
Wohlstadter Richard W
Wylie Todd N
Publication venue: Digital Commons@Becker
Publication date: 01/01/2015
Field of study

In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms

Crossref

Directory of Open Access Journals

Digital Commons@Becker

PubMed Central

FigShare

The Origin and Evolution of Mutations in Acute Myeloid Leukemia

Author: Alldredge Patricia A.
Baty Jack D.
Chen Ken
Cook Lisa L.
Delehaunty Kim D.
Demeter Ryan T.
Ding Li
DiPersio John F.
Dooling David J.
Fulton Lucinda A.
Fulton Robert S.
Graubert Timothy A.
Harris Christopher C.
Heath Sharon E.
Hundal Jasreet
Kalicki-Veizer Joelle M.
Kandoth Cyriac
Klco Jeffery M.
Koboldt Daniel C.
Kulkarni Shashikant
Lamprecht Tamara L.
Larson David E.
Ley Timothy J.
Lin Ling
Link Daniel C.
Liu Fulu
Lu Charles
Magrini Vincent J.
Mardis Elaine R.
McGrath Sean D.
McLellan Michael D.
McMichael Joshua F.
Miller Christopher A.
Nagarajan Rakesh
O’Laughlin Michelle D.
Payton Jacqueline E.
Reed Jerry P.
Schmidt Heather K.
Shannon William D.
Swift Gary W.
Tomasson Michael H.
Varghese Nobish
Vickery Tammi L.
Walker Jason R.
Wallis John W.
Walter Matthew J.
Wartman Lukas D.
Watson Mark A.
Welch John S.
Westervelt Peter
Wilson Richard K.
Wylie Todd N.
Xia Jun
Zhang Qunyuan
Publication venue: Elsevier Inc.
Publication date: 20/07/2012
Field of study

SummaryMost mutations in cancer genomes are thought to be acquired after the initiating event, which may cause genomic instability and drive clonal evolution. However, for acute myeloid leukemia (AML), normal karyotypes are common, and genomic instability is unusual. To better understand clonal evolution in AML, we sequenced the genomes of M3-AML samples with a known initiating event (PML-RARA) versus the genomes of normal karyotype M1-AML samples and the exomes of hematopoietic stem/progenitor cells (HSPCs) from healthy people. Collectively, the data suggest that most of the mutations found in AML genomes are actually random events that occurred in HSPCs before they acquired the initiating mutation; the mutational history of that cell is “captured” as the clone expands. In many cases, only one or two additional, cooperating mutations are needed to generate the malignant founding clone. Cells from the founding clone can acquire additional cooperating mutations, yielding subclones that can contribute to disease progression and/or relapse

Elsevier - Publisher Connector

PubMed Central