Search CORE

43 research outputs found

Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates

Author: Barnabas Beatrice B
Blakesley Robert W
Bouffard Gerard G
Brooks Shelise Y
Coleman Holly
Green Eric D
Gupta Jyoti
Haghighi Payam
Hansen Nancy F
Ho Shi-Ling
Maskeri Baishali
McDowell Jennifer C
Schandler Karen
Stantripop Sirintorn
Thomas Pamela J
Vogt Jennifer L
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage. Results To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain. Conclusion Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Recommended from our members

Dependence of Lactose Metabolism upon Mutarotase Encoded in the gal Operon in Escherichia coli

Author: Adhya Sankar L
Bouffard Gerard G
Rudd Kenneth E
Publication venue: Elsevier Ltd
Publication date: 01/01/1994
Field of study

A new gene ( galM) has been identified as the fourth cistron of the gal operon, encoding enzymes for the metabolism of galactose and lactose in Escherichia coli. Induction of the gal operon either from the gal promoters or from a neighboring prophage λ promoter expresses the galM gene as well. The new structure of the gal operon from the promoter end is galE-galT-galK-galM in counter-clockwise orientation on the chromosome. Genetic and biochemical analyses have revealed that the galM gene product has mutarotase activity, which converts α-aldose to the β-anomer. Unlike mutarotase from other bacteria in which the enzyme is primarily processed for export and secretion, the mutarotase from E. coli does not appear to be processed and yet is still found in periplasm (and culture media when overexpressed) in significant amounts. Although the interconversion of the sugar anomers occurs spontaneously in pure water in vitro, the in vivo formation of α- d-galactopyranose (the substrate for phosphorylation) from β- d-galactopyranose (generated by β-galactosidase hydrolysis of lactose) is largely dependent upon the presence of the mutarotase. This shows that efficient lactose metabolism requires mutarotase. These results give credence to the idea that the activity of intracellular water is not high enough to permit a simple extrapolation of observed in vitro reactions to in vivo situations in every case

University of Miami: Scholarship Miami

Hembase: browser and genome portal for hematology and erythroid biology

Author: Bouffard Gerard G.
Goh Sung-Ho
Lee Y. Terry
Miller Jeffery L.
Publication venue: Oxford University Press
Publication date: 01/01/2004
Field of study

Hembase (http://hembase.niddk.nih.gov) is an integrated browser and genome portal designed for web-based examination of the human erythroid transcriptome. To date, Hembase contains 15 752 entries from erythroblast Expressed Sequenced Tags (ESTs) and 380 referenced genes relevant for erythropoiesis. The database is organized to provide a cytogenetic band position, a unique name as well as a concise annotation for each entry. Search queries may be performed by name, keyword or cytogenetic location. Search results are linked to primary sequence data and three major human genome browsers for access to information considered current at the time of each search. Hembase provides interested scientists and clinical hematologists with a genome-based approach toward the study of erythroid biology

CiteSeerX

Crossref

PubMed Central

A diversity profile of the human skin microbiota

Author: Blakesley Robert W.
Bouffard Gerard G.
Grice Elizabeth A.
Kong Heidi H.
Renaud Gabriel
Segre Julia A.
Turner Maria L.
Wolfsberg Tyra G.
Young Alice C.
Publication venue: Cold Spring Harbor Laboratory Press
Publication date
Field of study

The many layers and structures of the skin serve as elaborate hosts to microbes, including a diversity of commensal and pathogenic bacteria that contribute to both human health and disease. To determine the complexity and identity of the microbes inhabiting the skin, we sequenced bacterial 16S small-subunit ribosomal RNA genes isolated from the inner elbow of five healthy human subjects. This analysis revealed 113 operational taxonomic units (OTUs; “phylotypes”) at the level of 97% similarity that belong to six bacterial divisions. To survey all depths of the skin, we sampled using three methods: swab, scrape, and punch biopsy. Proteobacteria dominated the skin microbiota at all depths of sampling. Interpersonal variation is approximately equal to intrapersonal variation when considering bacterial community membership and structure. Finally, we report strong similarities in the complexity and identity of mouse and human skin microbiota. This study of healthy human skin microbiota will serve to direct future research addressing the role of skin microbiota in health and disease, and metagenomic projects addressing the complex physiological interactions between the skin and the microbes that inhabit this environment

Crossref

PubMed Central

Pericentromeric Duplications in the Laboratory Mouse

Author: Beckstrom-Sternberg Stephen M.
Blakesley Robert W.
Bouffard Gerard G.
Green Eric D.
Idol Jacquelyn R.
Lee-Lin Shih-Queen
Maduro Valerie V.B.
McDowell Jennifer C.
Program NISC Comparative Sequencing
Schueler Mary G.
Summers Tyrone J.
Thomas James W.
Thomas Pamela J.
Touchman Jeffrey W.
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/01/2003
Field of study

Duplications have long been postulated to be an important mechanism by which genomes evolve. Interspecies genomic comparisons are one method by which the origin and molecular mechanism of duplications can be inferred. By comparative mapping in human, mouse, and rat, we previously found evidence for a recent chromosome-fission event that occurred in the mouse lineage. Cytogenetic mapping revealed that the genomic segments flanking the fission site appeared to be duplicated, with copies residing near the centromere of multiple mouse chromosomes. Here we report the mapping and sequencing of the regions of mouse chromosomes 5 and 6 involved in this chromosome-fission event as well as the results of comparative sequence analysis with the orthologous human and rat genomic regions. Our data indicate that the duplications associated with mouse chromosomes 5 and 6 are recent and that the resulting duplicated segments share significant sequence similarity with a series of regions near the centromeres of the mouse chromosomes previously identified by cytogenetic mapping. We also identified pericentromeric duplicated segments shared between mouse chromosomes 5 and 1. Finally, novel mouse satellite sequences as well as putative chimeric transcripts were found to be associated with the duplicated segments. Together, these findings demonstrate that pericentromeric duplications are not restricted to primates and may be a common mechanism for genome evolution in mammals. [Supplemental material is available online at www.genome.org.

Crossref

PubMed Central

Identifications and Quantitative Measurements of Narcotics in Solid Mixtures Using Near-IR Raman Spectroscopy and Multivariate Analysis

Author: Akhavan
Alan G. Ryder
Archibald
Bakker-Schut
Bouffard
Carey
Cheng
Cooper
Daniel
Daniel
Everall
Fell
Frank
Freeman
Geladi
Gerard M. O'Connor
Hayward
Hendra
Kuptsov
Lawson
Lewis
Marteau
Sands
Shimoyama
Thomas J. Glynn
Tsuchihashi
Williams
Xue
Publication venue: 'ASTM International'
Publication date
Field of study

Crossref

Generation and Comparative Analysis of ∼3.3 Mb of Mouse Genomic Sequence Orthologous to the Region of Human Chromosome 7q11.23 Implicated in Williams Syndrome

Author: Beckstrom-Sternberg Stephen M.
Blakesley Robert W.
Bouffard Gerard G.
DeSilva Udaya
Dietrich Nicole L.
Doyle Johannah L.
Elnitski Laura
Gan Weiniu
Green Eric D.
Idol Jacquelyn R.
McDowell Jennifer C.
Miller Webb
Schwartz Scott
Thomas James W.
Thomas Pamela J.
Touchman Jeffrey W.
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/01/2002
Field of study

Williams syndrome is a complex developmental disorder that results from the heterozygous deletion of a ∼1.6-Mb segment of human chromosome 7q11.23. These deletions are mediated by large (∼300 kb) duplicated blocks of DNA of near-identical sequence. Previously, we showed that the orthologous region of the mouse genome is devoid of such duplicated segments. Here, we extend our studies to include the generation of ∼3.3 Mb of genomic sequence from the mouse Williams syndrome region, of which just over 1.4 Mb is finished to high accuracy. Comparative analyses of the mouse and human sequences within and immediately flanking the interval commonly deleted in Williams syndrome have facilitated the identification of nine previously unreported genes, provided detailed sequence-based information regarding 30 genes residing in the region, and revealed a number of potentially interesting conserved noncoding sequences. Finally, to facilitate comparative sequence analysis, we implemented several enhancements to the program PipMaker, including the addition of links from annotated features within a generated percent-identity plot to specific records in public databases. Taken together, the results reported here provide an important comparative sequence resource that should catalyze additional studies of Williams syndrome, including those that aim to characterize genes within the commonly deleted interval and to develop mouse models of the disorder. [The sequence data described in this paper have been submitted to GenBank under accession nos. AF267747, AF289666, AF289667, AF289664, AF289665, AC091250, AC079938, AC084109, AC024607, AC074359, AC024608, AC083858, AC083948, AC084162, AC087420, AC083890, AC080158, AC084402, AC083889, AC083857, and AC079872.

Crossref

PubMed Central