Search CORE

197 research outputs found

Capturing the dynamics of genome replication on individual ultra-long nanopore sequence reads.

Author: Boemo Michael A
Kessler Benedikt M
Kriaucionis Skirmantas
Müller Carolin A
Nieduszynski Conrad A
Simpson Jared T
Spingardi Paolo
Publication venue: Nat Methods
Publication date: 22/04/2019
Field of study

Replication of eukaryotic genomes is highly stochastic, making it difficult to determine the replication dynamics of individual molecules with existing methods. We report a sequencing method for the measurement of replication fork movement on single molecules by detecting nucleotide analog signal currents on extremely long nanopore traces (D-NAscent). Using this method, we detect 5-bromodeoxyuridine (BrdU) incorporated by Saccharomyces cerevisiae to reveal, at a genomic scale and on single molecules, the DNA sequences replicated during a pulse-labeling period. Under conditions of limiting BrdU concentration, D-NAscent detects the differences in BrdU incorporation frequency across individual molecules to reveal the location of active replication origins, fork direction, termination sites, and fork pausing/stalling events. We used sequencing reads of 20-160 kilobases to generate a whole-genome single-molecule map of DNA replication dynamics and discover a class of low-frequency stochastic origins in budding yeast. The D-NAscent software is available at https://github.com/MBoemo/DNAscent.git

Crossref

Oxford University Research Archive

Apollo (Cambridge)

University of East Anglia digital repository

Using reference-free compressed data structures to analyze sequencing reads from thousands of human genomes.

Author: Cotten Matthew
Dolle Dirk D
Durbin Richard
Iqbal Zamin
Keane Thomas M
Liu Zhicheng
McCarthy Shane A
Simpson Jared T
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 22/06/2016
Field of study

We are rapidly approaching the point where we have sequenced millions of human genomes. There is a pressing need for new data structures to store raw sequencing data and efficient algorithms for population scale analysis. Current reference-based data formats do not fully exploit the redundancy in population sequencing nor take advantage of shared genetic variation. In recent years, the Burrows-Wheeler transform (BWT) and FM-index have been widely employed as a full-text searchable index for read alignment and de novo assembly. We introduce the concept of a population BWT and use it to store and index the sequencing reads of 2705 samples from the 1000 Genomes Project. A key feature is that, as more genomes are added, identical read sequences are increasingly observed, and compression becomes more efficient. We assess the support in the 1000 Genomes read data for every base position of two human reference assembly versions, identifying that 3.2 Mbp with population support was lost in the transition from GRCh37 with 13.7 Mbp added to GRCh38. We show that the vast majority of variant alleles can be uniquely described by overlapping 31-mers and show how rapid and accurate SNP and indel genotyping can be carried out across the genomes in the population BWT. We use the population BWT to carry out nonreference queries to search for the presence of all known viral genomes and discover human T-lymphotropic virus 1 integrations in six samples in a recognized epidemiological distribution

Crossref

LSHTM Research Online

PubMed Central

Oxford University Research Archive

Enlighten

MinION Analysis and Reference Consortium: Phase 1 data release and analysis

Author: Benedict Paten
Bonnie L. Brown
Camilla L.C. Ip
David A. Eccles
David Buck
Elizabeth M. Batty
Ewan Birney
Hans J. Jansen
Hugh E. Olsen
Jared T. Simpson
John M. Urban
John R. Tyson
Justin O'Grady
Mariateresa de Cesare
Matthew Loose
MinION Analysis and Reference Consortium
Miten Jain
Paolo Piazza
Richard M. Leggett
Rory J. Bowden
Sara Goodwin
Solomon Mwaigwisya
Terrance P. Snutch
Vadim Zalunin
Publication venue: 'F1000 Research Ltd'
Publication date: 01/10/2015
Field of study

The advent of a miniaturized DNA sequencing device with a high-throughput contextual sequencing capability embodies the next generation of large scale sequencing tools. The MinION™ Access Programme (MAP) was initiated by Oxford Nanopore Technologies™ in April 2014, giving public access to their USB-attached miniature sequencing device. The MinION Analysis and Reference Consortium (MARC) was formed by a subset of MAP participants, with the aim of evaluating and providing standard protocols and reference data to the community. Envisaged as a multi-phased project, this study provides the global community with the Phase 1 data from MARC, where the reproducibility of the performance of the MinION was evaluated at multiple sites. Five laboratories on two continents generated data using a control strain of Escherichia coli K-12, preparing and sequencing samples according to a revised ONT protocol. Here, we provide the details of the protocol used, along with a preliminary analysis of the characteristics of typical runs including the consistency, rate, volume and quality of data produced. Further analysis of the Phase 1 data presented here, and additional experiments in Phase 2 of E. coli from MARC are already underway to identify ways to improve and enhance MinION performance

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Directory of Open Access Journals

PubMed Central

University of East Anglia digital repository

Deep short-read sequencing of chromosome 17 from the mouse strains A/J and CAST/Ei identifies significant germline variation and candidate genes that regulate liver triglyceride levels

Author: Adams David J
Brown Steve D
Croniger Colleen M
Durbin Richard
Hurles Matthew E
Keane Thomas
Li Heng
Lynch Dee
Nadeau Joseph H
Ning Zemin
Rust Alistair G
Simpson Jared T
Stalker Jim
Sudbery Ian
Teboul Lydia
Walter Klaudia
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Methods for accurate identification of nucleotide and structural variation using de novo short read sequencing of mouse chromosomes are described

Crossref

Springer - Publisher Connector

PubMed Central

White Rose Research Online

Author Correction: Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples

Author: Bailey Matthew H
Ding Li
Dong Guanlan
Dursi Lewis Jonathan
Ellrott Kyle
Gerstein Mark B
Getz Gad
Kelso Sean
Li Shantao
Li Yize
Liang Wen-Wei
MC3 Working Group
Meyerson William U
PCAWG Consortium
PCAWG novel somatic mutation calling methods working group
Saksena Gordon
Simpson Jared T
Wang Liang-Bo
Weerasinghe Amila
Wendl Michael C
Wheeler David A
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/11/2020
Field of study

Correction to this paper has been published: https://doi.org/10.1038/s41467-020-20128-w

ZORA

Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples

Author: Bailey Matthew H
Ding Li
Dong Guanlan
Dursi Lewis Jonathan
Ellrott Kyle
Gerstein Mark B
Getz Gad
Kelso Sean
Li Shantao
Li Yize
Liang Wen-Wei
MC3 Working Group
Meyerson William U
PCAWG Consortium
PCAWG novel somatic mutation calling methods working group
Saksena Gordon
Simpson Jared T
Wang Liang-Bo
Weerasinghe Amila
Wendl Michael C
Wheeler David A
Publication venue: Nature Publishing Group
Publication date: 21/09/2020
Field of study

The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts

ZORA

Nanopore native RNA sequencing of a human poly(A) transcriptome

Author: Akeson Mark
Brooks Angela N.
de Jesus Jaqueline Goes
Gilpatrick Timothy
Holmes Nadine
Jain Miten
Jones Karen L.
Loman Nicholas
Loose Matthew
Olsen Hugh E.
Paten Benedict
Payne Alexander
Quick Joshua
Razaghi Roham
Sadowski Norah
Simpson Jared T.
Snutch Terrance P.
Soulette Cameron M.
Tang Alison D.
Tang Paul S.
Timp Winston
Tyson John R.
Workman Rachael E.
Zuzarte Philip C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/11/2019
Field of study

High-throughput complementary DNA sequencing technologies have advanced our understanding of transcriptome complexity and regulation. However, these methods lose information contained in biological RNA because the copied reads are often short and modifications are not retained. We address these limitations using a native poly(A) RNA sequencing strategy developed by Oxford Nanopore Technologies. Our study generated 9.9 million aligned sequence reads for the human cell line GM12878, using thirty MinION flow cells at six institutions. These native RNA reads had a median length of 771 bases, and a maximum aligned length of over 21,000 bases. Mitochondrial poly(A) reads provided an internal measure of read-length quality. We combined these long nanopore reads with higher accuracy short-reads and annotated GM12878 promoter regions to identify 33,984 plausible RNA isoforms. We describe strategies for assessing 3′ poly(A) tail length, base modifications and transcript haplotypes

Repository@Nottingham

eScholarship - University of California

Author Correction:Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples (Nature Communications, (2020), 11, 1, (4748), 10.1038/s41467-020-18151-y)

Author: Bailey Matthew H.
Ding Li
Dong Guanlan
Dursi Lewis Jonathan
Ellrott Kyle
Gerstein Mark B.
Getz Gad
Kelso Sean
Li Shantao
Li Yize
Liang Wen-Wei
Marchal Kathleen
MC3 Working Group [missing]
Meyerson William U.
PCAWG novel somatic mutation calling methods working group [missing]
PCAWG [missing]
Pulido Tamayo Sergio
Saksena Gordon
Simpson Jared T.
Verbeke Lieven
Wang Liang-Bo
Weerasinghe Amila
Wendl Michael C.
Wheeler David A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

The original version of this Article omitted from the author list the 9th author Yize Li, who is from the ‘The McDonnell Genome Institute at Washington University, St. Louis, MO 63108, USA and Department of Medicine, Division of Oncology, Washington University School of Medicine, St. Louis, MO 63108, USA’. This has been corrected in both the PDF and HTML versions of the Article

Crossref

Ghent University Academic Bibliography

University of Miami: Scholarship Miami

eScholarship - University of California

University of Dundee Online Publications

Apollo (Cambridge)

Trait Variation in Yeast Is Defined by Population History

A fundamental goal in biology is to achieve a mechanistic understanding of how and to what extent ecological variation imposes selection for distinct traits and favors the fixation of specific genetic variants. Key to such an understanding is the detailed mapping of the natural genomic and phenomic space and a bridging of the gap that separates these worlds. Here we chart a high-resolution map of natural trait variation in one of the most important genetic model organisms, the budding yeast Saccharomyces cerevisiae, and its closest wild relatives and trace the genetic basis and timing of major phenotype changing events in its recent history. We show that natural trait variation in S. cerevisiae exceeds that of its relatives, despite limited genetic variation, and follows the population history rather than the source environment. In particular, the West African population is phenotypically unique, with an extreme abundance of low-performance alleles, notably a premature translational termination signal in GAL3 that cause inability to utilize galactose. Our observations suggest that many S. cerevisiae traits may be the consequence of genetic drift rather than selection, in line with the assumption that natural yeast lineages are remnants of recent population bottlenecks. Disconcertingly, the universal type strain S288C was found to be highly atypical, highlighting the danger of extrapolating gene-trait connections obtained in mosaic, lab-domesticated lineages to the species as a whole. Overall, this study represents a step towards an in-depth understanding of the causal relationship between co-variation in ecology, selection pressure, natural traits, molecular mechanism, and alleles in a key model organism

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Leicester Research Archive