Search CORE

9 research outputs found

Genome Reference and Sequence Variation in the Large Repetitive Central Exon of Human MUC5AC

Author: Boellmann Frank
Corcoran David L
Dang Hong
Fedrigo Olivier
Guo Xueliang
Haridass Prashamsha
Jones Corbin D
Knowles Michael R
O'Neal Wanda K
Pace Rhonda G
Ranade Swati S
Seibold Max A
Stonebraker Jaclyn R
Voynow Judith A
Yuan George
Zheng Shuo
Publication venue
Publication date: 01/01/2013
Field of study

Despite modern sequencing efforts, the difficulty in assembly of highly repetitive sequences has prevented resolution of human genome gaps, including some in the coding regions of genes with important biological functions. One such gene, MUC5AC, encodes a large, secreted mucin, which is one of the two major secreted mucins in human airways. The MUC5AC region contains a gap in the human genome reference (hg19) across the large, highly repetitive, and complex central exon. This exon is predicted to contain imperfect tandem repeat sequences and multiple conserved cysteine-rich (CysD) domains. To resolve the MUC5AC genomic gap, we used high-fidelity long PCR followed by single molecule real-time (SMRT) sequencing. This technology yielded long sequence reads and robust coverage that allowed for de novo sequence assembly spanning the entire repetitive region. Furthermore, we used SMRT sequencing of PCR amplicons covering the central exon to identify genetic variation in four individuals. The results demonstrated the presence of segmental duplications of CysD domains, insertions/deletions (indels) of tandem repeats, and single nucleotide variants. Additional studies demonstrated that one of the identified tandem repeat insertions is tagged by nonexonic single nucleotide polymorphisms. Taken together, these data illustrate the successful utility of SMRT sequencing long reads for de novo assembly of large repetitive sequences to fill the gaps in the human genome. Characterization of the MUC5AC gene and the sequence variation in the central exon will facilitate genetic and functional studies for this critical airway mucin

PubMed Central

Carolina Digital Repository

Genome Reference and Sequence Variation in the Large Repetitive Central Exon of Human MUC5AC

Author: Corbin D Jones
David L Corcoran
Frank Boellmann
George Yuan
Hong Dang
Jaclyn R Stonebraker
Judith A Voynow
Max A Seibold
Michael R Knowles
Olivier Fedrigo
Prashamsha Haridass
Rhonda G Pace
Shuo Zheng
Swati S Ranade
Wanda K O'Neal
Xueliang Guo
Publication venue: 'American Thoracic Society'
Publication date: 01/01/2013
Field of study

Crossref

PubMed Central

Carolina Digital Repository

Rapid whole-genome mutational profiling using next-generation sequencing technologies

Author: Blanchard Alan P.
Chapman Jarrod
Chen Feng
Coleman Brittney E.
Donahue William F.
Hillman David
Jeffries Thomas W.
Lee Clarence C.
Makowsky Kathryn
Malek Joel A.
Marth Gabor T.
McKernan Kevin J.
McLaughlin Stephen F.
Peckham Heather E.
Quinlan Aaron R.
Ranade Swati S.
Richardson Paul M.
Rokhsar Daniel S.
Shen Lei
Smith Douglas R.
Sorenson Jon M.
Stewart Donald A.
Stromberg Michael P.
Tao Wei
Tusneem Nadeem
Warner Jason B.
Woolf Betty
Zhang Lu
Zhang Zheng
Publication venue: Cold Spring Harbor Laboratory Press
Publication date
Field of study

Forward genetic mutational studies, adaptive evolution, and phenotypic screening are powerful tools for creating new variant organisms with desirable traits. However, mutations generated in the process cannot be easily identified with traditional genetic tools. We show that new high-throughput, massively parallel sequencing technologies can completely and accurately characterize a mutant genome relative to a previously sequenced parental (reference) strain. We studied a mutant strain of Pichia stipitis, a yeast capable of converting xylose to ethanol. This unusually efficient mutant strain was developed through repeated rounds of chemical mutagenesis, strain selection, transformation, and genetic manipulation over a period of seven years. We resequenced this strain on three different sequencing platforms. Surprisingly, we found fewer than a dozen mutations in open reading frames. All three sequencing technologies were able to identify each single nucleotide mutation given at least 10–15-fold nominal sequence coverage. Our results show that detecting mutations in evolved and engineered organisms is rapid and cost-effective at the whole-genome level using new sequencing technologies. Identification of specific mutations in strains with altered phenotypes will add insight into specific gene functions and guide further metabolic engineering efforts

Crossref

PubMed Central

Regenerative Glycosylation under Nucleophilic Catalysis

Author: Alexei V. Demchenko
Andersson F.
Chen X.
Chu A.-H. A.
Codee J. D. C.
Crich D.
Crich D.
Danishefsky S. J.
Demchenko A. V.
Demchenko A. V.
Friesen R. W.
Fu G. C.
Garcia B. A.
Garegg P. J.
Halcomb R. L.
Hasty S. J.
Huang L.
Huang L.
Huang X.
Ichikawa Y.
Keith J. Stine
Liu C.-Y. I.
Loskot S. A.
Lu S. R.
Mydock L. K.
Nicolaou K. C.
Nicolaou K. C.
Nigudkar S. S.
Nogueira J. M.
Park J.
Ranade S. C.
Randall J. L.
Schmidt R. R.
Smoot J. T.
Swati S. Nigudkar
Thygesen M. B.
Torres J. C.
Varki A.
Vocadlo D. J.
Yasomanee J. P.
Yu B.
Yu B.
Zeng Y.
Zhu J.
Zhu X.
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref

Stem cell transcriptome profiling via massive-scale mRNA sequencing

Author: Alan J Robertson
Alistair R R Forrest
Andrew C Perkins
Anita L Steptoe
Brooke B A Gardiner
CE Hirst
Clarence C Lee
D Karolchik
D Thierry-Mieg
Darrin F Taylor
E Birney
Gabriel Kolle
Geoffrey J Faulkner
GJ Faulkner
GK Smyth
Graeme Bethel
H Kiyosawa
H Ohtake
Heather E Peckham
J Kawai
J Reinartz
J Shendure
JB Kim
Jonathan M Manning
Kevin J McKernan
M Matz
Mellissa K Brown
MS Boguski
Nicole Cloonan
P Carninci
P Kapranov
PG Engstrom
PJ Gardina
Q Zhou
R Lister
S Cawley
Sean M Grimmond
Shivangi Wani
SJ Bruce
SJ Bruce
Stephen J Bruce
Swati S Ranade
T Shiraki
TA Clark
VE Velculescu
WJ Kent
WM Schmidt
WR Jeck
Y Lee
Y Okazaki
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

We developed a massive-scale RNA sequencing protocol, short quantitative random RNA libraries or SQRL, to survey the complexity, dynamics and sequence content of transcriptomes in a near-complete fashion. This method generates directional, random-primed, linear cDNA libraries that are optimized for next-generation short-tag sequencing. We surveyed the poly(A)<sup>+</sup> transcriptomes of undifferentiated mouse embryonic stem cells (ESCs) and embryoid bodies (EBs) at an unprecedented depth (10 Gb), using the Applied Biosystems SOLiD technology. These libraries capture the genomic landscape of expression, state-specific expression, single-nucleotide polymorphisms (SNPs), the transcriptional activity of repeat elements, and both known and new alternative splicing events. We investigated the impact of transcriptional complexity on current models of key signaling pathways controlling ESC pluripotency and differentiation, highlighting how SQRL can be used to characterize transcriptome content and dynamics in a quantitative and reproducible manner, and suggesting that our understanding of transcriptional complexity is far from complete

Crossref

Enlighten

An Inv(16)(p13.3q24.3)-Encoded CBFA2T3-GLIS2 Fusion Protein Defines an Aggressive Subtype of Pediatric Acute Megakaryoblastic Leukemia

Author: Adachi Souichi
Andersson Anna K
Ballerini Paola
Barbato Michael I
Biondi Andrea
Cazzaniga Giovanni
Chen Shann-Ching
Dang Jinjun
Ding Li
Downing James R
Döhner Hartmut
Döhner Konstanze
Easton John
Ganti Ramapriya
Gruber Tanja A
Gupta Vedant
Hayashi Yasuhide
Kantarjian Hagop
Kornblau Steven M
Koss Cary S
Larson Gedman Amanda
Ley Timothy J
Liang Der-Cherng
Ma Jing
Manne Jayanthi
Marada Suresh
Mardis Elaine R
Mulder Heather L
Nimer Stephen D
Ogden Stacey K
Parker Matthew
Pounds Stanley
Pui Ching-Hon
Radtke Ina
Ranade Swati
Ravandi Farhad
Rubnitz Jeffrey E
Rusch Michael
Shi Lei
Shih Lee-Yung
Shurtleff Sheila
Su Xiaoping
Ta Huy Q
Tawa Akio
Tomizawa Daisuke
Wang Jianmin
Wilson Richard K
Wu Gang
Zhang Jinghui
Publication venue: Elsevier Inc
Publication date: 01/11/2012
Field of study

To define the mutation spectrum in non-Down syndrome acute megakaryoblastic leukemia (non-DS-AMKL), we performed transcriptome sequencing on diagnostic blasts from 14 pediatric patients and validated our findings in a recurrency/validation cohort consisting of 34 pediatric and 28 adult AMKL samples. Our analysis identified a cryptic chromosome 16 inversion (inv(16)(p13.3q24.3)) in 27% of pediatric cases, which encodes a CBFA2T3-GLIS2 fusion protein. Expression of CBFA2T3-GLIS2 in Drosophila and murine hematopoietic cells induced bone morphogenic protein (BMP) signaling and resulted in a marked increase in the self-renewal capacity of hematopoietic progenitors. These data suggest that expression of CBFA2T3-GLIS2 directly contributes to leukemogenesis. ► CBFA2T3-GLIS2 is a recurrent fusion gene in pediatric AMKL ► CBFA2T3-GLIS2 AMKL has a distinct expression profile and an inferior outcome ► CBFA2T3-GLIS2 induces BMP signaling and enhanced self-renewal of progenitor cell

Crossref

Elsevier - Publisher Connector

Lund University Publications

PubMed Central

University of Miami: Scholarship Miami

Analysis of the Otd-dependent transcriptome supports the evolutionary conservation of CRX/OTX/OTD functions in flies and vertebrates

Crossref

Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding

Author: Alkan Can
Antipova Alena A.
Bafna Vineet
Bashir Ali
Beaudoin Robert E.
Blanchard Alan P.
Clouser Christopher R.
Coleman Brittany E.
Costa Gina L.
De La Vega Francisco M.
Dimalanta Eileen T.
Duncan Cisyla
Eichler Evan E.
Fu Haoning
Fu Yutao
Gottimukkala Rajesh K.
Hayashibara Kathleen C.
Hendrickson Cynthia L.
Hyland Fiona C.
Ichikawa Jeffrey K.
Kidd Jeffrey M.
Kotler Lev
Laptewicz Michael W.
Lee Clarence C.
Li Bin
Lyons Michael R.
MacBride Andrew
Malek Joel A.
Manning Jonathan M.
McKernan Kevin Judd
McLaughlin Stephen F.
Moore Michael P.
Peckham Heather E.
Perez Damon S.
Ranade Swati S.
Reese Martin G.
Rhodes Michael D.
Sannicandro Adam E.
Sheridan Andrew
Sokolsky Tanya D.
Stuart Jeremy R.
Tsung Eric F.
Yang Shan
Zhang Lei
Zhang Zheng
Publication venue: Cold Spring Harbor Laboratory Press
Publication date
Field of study

We describe the genome sequencing of an anonymous individual of African origin using a novel ligation-based sequencing assay that enables a unique form of error correction that improves the raw accuracy of the aligned reads to >99.9%, allowing us to accurately call SNPs with as few as two reads per allele. We collected several billion mate-paired reads yielding ∼18× haploid coverage of aligned sequence and close to 300× clone coverage. Over 98% of the reference genome is covered with at least one uniquely placed read, and 99.65% is spanned by at least one uniquely placed mate-paired clone. We identify over 3.8 million SNPs, 19% of which are novel. Mate-paired data are used to physically resolve haplotype phases of nearly two-thirds of the genotypes obtained and produce phased segments of up to 215 kb. We detect 226,529 intra-read indels, 5590 indels between mate-paired reads, 91 inversions, and four gene fusions. We use a novel approach for detecting indels between mate-paired reads that are smaller than the standard deviation of the insert size of the library and discover deletions in common with those detected with our intra-read approach. Dozens of mutations previously described in OMIM and hundreds of nonsynonymous single-nucleotide and structural variants in genes previously implicated in disease are identified in this individual. There is more genetic variation in the human genome still to be uncovered, and we provide guidance for future surveys in populations and cancer biopsies

Crossref

PubMed Central