Search CORE

21 research outputs found

ALLPATHS 2: Small Genomes Assembled Accurately and with High Continuity from Short Paired Reads

Author: Burton Joshua
Gnerre Sante
Gnirke Andreas
Jaffe David B
MacCallum Iain
Malek Joel
McKernan Kevin
Nusbaum Chad
Przybylski Dariusz
Ranade Swati
Shea Terrance P
Shlyakhter Ilya
Williams Louise
Young Sarah
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

We demonstrate that genome sequences approaching finished quality can be generated from short paired reads. Using 36 base (fragment) and 26 base (jumping) reads from five microbial genomes of varied GC composition and sizes up to 40 Mb, ALLPATHS2 generated assemblies with long, accurate contigs and scaffolds. Velvet and EULER-SR were less accurate. For example, for Escherichia coli, the fraction of 10-kb stretches that were perfect was 99.8% (ALLPATHS2), 68.7% (Velvet), and 42.1% (EULER-SR).Organismic and Evolutionary Biolog

Crossref

Harvard University - DASH

Springer

Springer - Publisher Connector

Genome Reference and Sequence Variation in the Large Repetitive Central Exon of Human MUC5AC

Author: Boellmann Frank
Corcoran David L
Dang Hong
Fedrigo Olivier
Guo Xueliang
Haridass Prashamsha
Jones Corbin D
Knowles Michael R
O'Neal Wanda K
Pace Rhonda G
Ranade Swati S
Seibold Max A
Stonebraker Jaclyn R
Voynow Judith A
Yuan George
Zheng Shuo
Publication venue
Publication date: 01/01/2013
Field of study

Despite modern sequencing efforts, the difficulty in assembly of highly repetitive sequences has prevented resolution of human genome gaps, including some in the coding regions of genes with important biological functions. One such gene, MUC5AC, encodes a large, secreted mucin, which is one of the two major secreted mucins in human airways. The MUC5AC region contains a gap in the human genome reference (hg19) across the large, highly repetitive, and complex central exon. This exon is predicted to contain imperfect tandem repeat sequences and multiple conserved cysteine-rich (CysD) domains. To resolve the MUC5AC genomic gap, we used high-fidelity long PCR followed by single molecule real-time (SMRT) sequencing. This technology yielded long sequence reads and robust coverage that allowed for de novo sequence assembly spanning the entire repetitive region. Furthermore, we used SMRT sequencing of PCR amplicons covering the central exon to identify genetic variation in four individuals. The results demonstrated the presence of segmental duplications of CysD domains, insertions/deletions (indels) of tandem repeats, and single nucleotide variants. Additional studies demonstrated that one of the identified tandem repeat insertions is tagged by nonexonic single nucleotide polymorphisms. Taken together, these data illustrate the successful utility of SMRT sequencing long reads for de novo assembly of large repetitive sequences to fill the gaps in the human genome. Characterization of the MUC5AC gene and the sequence variation in the central exon will facilitate genetic and functional studies for this critical airway mucin

PubMed Central

Carolina Digital Repository

Reference Grade Characterization of Polymorphisms in Full-Length HLA Class I and II Genes With Short-Read Sequencing on the ION PGM System and Long-Reads Generated by Single Molecule, Real-Time Sequencing on the PacBio Platform

Author: Akira Oka
Anri Masuya
Atsuko Shigenari
Hidetoshi Inoko
Jerzy K. Kulski
Jerzy K. Kulski
John Harting
Junichi Sunaga
Ken Osaki
Miwako Kitazume
Primo Baybayan
Satoko Morishima
Sayaka Ito
Shingo Suzuki
Swati Ranade
Takashi Shiina
Yasuo Morishima
Yuko Ohnuki
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

Although NGS technologies fuel advances in high-throughput HLA genotyping methods for identification and classification of HLA genes to assist with precision medicine efforts in disease and transplantation, the efficiency of these methods are impeded by the absence of adequately-characterized high-frequency HLA allele reference sequence databases for the highly polymorphic HLA gene system. Here, we report on producing a comprehensive collection of full-length HLA allele sequences for eight classical HLA loci found in the Japanese population. We augmented the second-generation short read data generated by the Ion Torrent technology with long amplicon spanning consensus reads delivered by the third-generation SMRT sequencing method to create reference grade high-quality sequences of HLA class I and II gene alleles resolved at the genomic coding and non-coding level. Forty-six DNAs were obtained from a reference set used previously to establish the HLA allele frequency data in Japanese subjects. The samples included alleles with a collective allele frequency in the Japanese population of more than 99.2%. The HLA loci were independently amplified by long-range PCR using previously designed HLA-locus specific primers and subsequently sequenced using SMRT and Ion PGM sequencers. The mapped long and short-reads were used to produce a reference library of consensus HLA allelic sequences with the help of the reference-aware software tool LAA for SMRT Sequencing. A total of 253 distinct alleles were determined for 46 healthy subjects. Of them, 137 were novel alleles: 101 SNVs and/or indels and 36 extended alleles at a partial or full-length level. Comparing the HLA sequences from the perspective of nucleotide diversity revealed that HLA-DRB1 was the most divergent among the eight HLA genes, and that the HLA-DPB1 gene sequences diverged into two distinct groups, DP2 and DP5, with evidence of independent polymorphisms generated in exon 2. We also identified two specific intronic variations in HLA-DRB1 that might be involved in rheumatoid arthritis. In conclusion, full-length HLA allele sequencing by third-generation and second-generation technologies has provided polymorphic gene reference sequences at a genomic allelic resolution including allelic variations assigned up to the field-4 level for a stronger foundation in precision medicine and HLA-related disease and transplantation studies

Directory of Open Access Journals

Frontiers - Publisher Connector

Partner specificity is essential for proper function of the SIX-type homeodomain proteins Sine oculis and Optix during fly eye development

Author: Cai Chuan Qi
Clouser Chris
Decene Gina
Kenyon Kristy L.
Pignoni Francesca
Ranade Swati
Tran Susan
Yang-Zhou Donghui
Publication venue: Elsevier Inc.
Publication date: 01/10/2005
Field of study

AbstractThe development of the Drosophila visual system utilizes two members of the highly conserved Six-Homeobox family of transcription factor, Sine oculis and Optix. Although in vitro studies have detected differences in DNA-binding and interactions with some co-factors, questions remain as to what extent the activity for these two transcriptional regulators is redundant or specific in vivo. In this work, we show that the SoD mutation within the Six domain does not abolish DNA–protein interactions, but alters co-factor binding specificity to resemble that of Optix. A mutation in the same region of Optix alters its activity in vivo. We propose that the dominant mutant phenotype is primarily due to an alteration in binding properties of the Sine oculis protein and that distinct partner interactions is one important mechanism in determining significant functional differences between these highly conserved factors during eye development

Elsevier - Publisher Connector

A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning

Author: Costa Gina
Fire Andrew
Ichikawa Jeffrey
Johnson Steven M.
Malek Joel A.
McKernan Kevin
Peckham Heather
Ranade Swati
Sidow Arend
Stuart Jeremy
Tonthat Thaisan
Valouev Anton
Zeng Kathy
Publication venue: Cold Spring Harbor Laboratory Press
Publication date
Field of study

Using the massively parallel technique of sequencing by oligonucleotide ligation and detection (SOLiD; Applied Biosystems), we have assessed the in vivo positions of more than 44 million putative nucleosome cores in the multicellular genetic model organism Caenorhabditis elegans. These analyses provide a global view of the chromatin architecture of a multicellular animal at extremely high density and resolution. While we observe some degree of reproducible positioning throughout the genome in our mixed stage population of animals, we note that the major chromatin feature in the worm is a diversity of allowed nucleosome positions at the vast majority of individual loci. While absolute positioning of nucleosomes can vary substantially, relative positioning of nucleosomes (in a repeated array structure likely to be maintained at least in part by steric constraints) appears to be a significant property of chromatin structure. The high density of nucleosomal reads enabled a substantial extension of previous analysis describing the usage of individual oligonucleotide sequences along the span of the nucleosome core and linker. We release this data set, via the UCSC Genome Browser, as a resource for the high-resolution analysis of chromatin conformation and DNA accessibility at individual loci within the C. elegans genome

Crossref

PubMed Central

Reconstructing complex regions of genomes using long-read sequencing technology

Author: Alkan Can
Antonacci Francesca
Chaisson Mark
Dennis Megan Y
Eichler Evan E
Graves Tina A
Hon Lawrence
Huddleston John
Korlach Jonas
Malig Maika
Ranade Swati
Sudmant Peter H
Turner Stephen W
Wilson Richard K
Publication venue: eScholarship, University of California
Publication date: 13/01/2014
Field of study

Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger shotgun sequencing of clone inserts, however, has now been largely abandoned, leaving most of these regions unresolved in newer genome assemblies generated primarily by next-generation sequencing hybrid approaches. Here we show that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences (PacBio). We sequenced and assembled BAC clones corresponding to a 1.3-Mbp complex region of chromosome 17q21.31, demonstrating 99.994% identity to Sanger assemblies of the same clones. We targeted 44 differences using Illumina sequencing and find that PacBio and Sanger assemblies share a comparable number of validated variants, albeit with different sequence context biases. Finally, we targeted a poorly assembled 766-kbp duplicated region of the chimpanzee genome and resolved the structure and organization for a fraction of the cost and time of traditional finishing approaches. Our data suggest a straightforward path for upgrading genomes to a higher quality finished state

Crossref

PubMed Central

eScholarship - University of California

Genome Reference and Sequence Variation in the Large Repetitive Central Exon of Human MUC5AC

Author: Corbin D Jones
David L Corcoran
Frank Boellmann
George Yuan
Hong Dang
Jaclyn R Stonebraker
Judith A Voynow
Max A Seibold
Michael R Knowles
Olivier Fedrigo
Prashamsha Haridass
Rhonda G Pace
Shuo Zheng
Swati S Ranade
Wanda K O'Neal
Xueliang Guo
Publication venue: 'American Thoracic Society'
Publication date: 01/01/2013
Field of study

Crossref

PubMed Central

Carolina Digital Repository

Human immunodeficiency virus/acquired immune deficiency syndrome: A relook into the challenge from an integrated, yogic perspective

Author: Aaron
Audet
Bagde
Baum
Bhargav
Bhargav
Bormann
Byadgi
Byadgi
Creswell
Data
Debnath
Duesberg
Green
Gupta
Hamilton
Harichandra
Hileman
Joore
Kalichman
Kalra
Kavitha
Koar
Makubi
Mills
Murphy
National
Nerad
Pande
Pennap
Ranade
Rochira
Rochira
Saini
Shailendra
Shankalala
Sharma
Shilwant
Siegel
Singh
Swati
Wong
Publication venue: 'Medknow'
Publication date: 01/01/2019
Field of study

Crossref