Search CORE

179 research outputs found

Rosenberger v. Rector & Visitors of the University of Virginia: The Myth of the Content Neutral Establishment Clause

Author: Salzberg Mark Daniel
Publication venue: FLASH: The Fordham Law Archive of Scholarship and History
Publication date: 01/03/1996
Field of study

bepress Legal Repository

Fordham University School of Law

Hypertension in the Kidney Transplant Recipient

Author: Heather H. Jones
Salzberg Daniel J.
Publication venue: 'IntechOpen'
Publication date: 17/08/2011
Field of study

IntechOpen

Hypertension in the Kidney Transplant Recipient

Author: Heather H. Jones
Salzberg Daniel J.
Publication venue: 'IntechOpen'
Publication date: 17/08/2011
Field of study

IntechOpen

Crossref

Minimus: a fast, lightweight genome assembler

Author: Delcher Arthur L
Pop Mihai
Salzberg Steven L
Sommer Daniel D
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Genome assemblers have grown very large and complex in response to the need for algorithms to handle the challenges of large whole-genome sequencing projects. Many of the most common uses of assemblers, however, are best served by a simpler type of assembler that requires fewer software components, uses less memory, and is far easier to install and run. RESULTS: We have developed the Minimus assembler to address these issues, and tested it on a range of assembly problems. We show that Minimus performs well on several small assembly tasks, including the assembly of viral genomes, individual genes, and BAC clones. In addition, we evaluate Minimus' performance in assembling bacterial genomes in order to assess its suitability as a component of a larger assembly pipeline. We show that, unlike other software currently used for these tasks, Minimus produces significantly fewer assembly errors, at the cost of generating a more fragmented assembly. CONCLUSION: We find that for small genomes and other small assembly tasks, Minimus is faster and far more flexible than existing tools. Due to its small size and modular design Minimus is perfectly suited to be a component of complex assembly pipelines. Minimus is released as an open-source software project and the code is available as part of the AMOS project at Sourceforge

Crossref

Springer - Publisher Connector

PubMed Central

Digital Repository at the University of Maryland

Complete Columbian mammoth mitogenome suggests interbreeding with woolly mammoths

Author: Debruyne Regis
Devault Alison
Enk Jacob
Fisher Daniel
King Christine E
MacPhee Ross
O'Rourke Dennis
Poinar Hendrik
Salzberg Steven L
Treangen Todd
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Late Pleistocene North America hosted at least two divergent and ecologically distinct species of mammoth: the periglacial woolly mammoth (Mammuthus primigenius) and the subglacial Columbian mammoth (Mammuthus columbi). To date, mammoth genetic research has been entirely restricted to woolly mammoths, rendering their genetic evolution difficult to contextualize within broader Pleistocene paleoecology and biogeography. Here, we take an interspecific approach to clarifying mammoth phylogeny by targeting Columbian mammoth remains for mitogenomic sequencing. Results We sequenced the first complete mitochondrial genome of a classic Columbian mammoth, as well as the first complete mitochondrial genome of a North American woolly mammoth. Somewhat contrary to conventional paleontological models, which posit that the two species were highly divergent, the M. columbi mitogenome we obtained falls securely within a subclade of endemic North American M. primigenius. Conclusions Though limited, our data suggest that the two species interbred at some point in their evolutionary histories. One potential explanation is that woolly mammoth haplotypes entered Columbian mammoth populations via introgression at subglacial ecotones, a scenario with compelling parallels in extant elephants and consistent with certain regional paleontological observations. This highlights the need for multi-genomic data to sufficiently characterize mammoth evolutionary history. Our results demonstrate that the use of next-generation sequencing technologies holds promise in obtaining such data, even from non-cave, non-permafrost Pleistocene depositional contexts.http://deepblue.lib.umich.edu/bitstream/2027.42/112426/1/13059_2011_Article_2544.pd

Crossref

Springer - Publisher Connector

KU ScholarWorks

PubMed Central

Digital Repository at the University of Maryland

Deep Blue Documents at the University of Michigan

The Douglas-Fir Genome Sequence Reveals Specialization of the Photosynthetic Apparatus in Pinaceae.

Author: Cardeno Charis
Casola Claudio
Crepeau Marc W
Cronn Richard
Gonzalez-Ibeas Daniel
Holt Carson
Koralewski Tomasz E
Langley Charles H
McGuire Patrick E
Neale David B
Paul Robin
Pertea Geo M
Puiu Daniela
Salzberg Steven L
Sezen U Uzay
Stevens Kristian A
Wegrzyn Jill L
Wheeler Nicholas C
Yandell Mark
Yorke James A
Zaman Sumaira
Zimin Aleksey V
Publication venue: eScholarship, University of California
Publication date: 01/09/2017
Field of study

A reference genome sequence for Pseudotsuga menziesii var. menziesii (Mirb.) Franco (Coastal Douglas-fir) is reported, thus providing a reference sequence for a third genus of the family Pinaceae. The contiguity and quality of the genome assembly far exceeds that of other conifer reference genome sequences (contig N50 = 44,136 bp and scaffold N50 = 340,704 bp). Incremental improvements in sequencing and assembly technologies are in part responsible for the higher quality reference genome, but it may also be due to a slightly lower exact repeat content in Douglas-fir vs. pine and spruce. Comparative genome annotation with angiosperm species reveals gene-family expansion and contraction in Douglas-fir and other conifers which may account for some of the major morphological and physiological differences between the two major plant groups. Notable differences in the size of the NDH-complex gene family and genes underlying the functional basis of shade tolerance/intolerance were observed. This reference genome sequence not only provides an important resource for Douglas-fir breeders and geneticists but also sheds additional light on the evolutionary processes that have led to the divergence of modern angiosperms from the more ancient gymnosperms

Directory of Open Access Journals

eScholarship - University of California

Major data analysis errors invalidate cancer microbiome findings

Author: Brewer Daniel S.
Cooper Colin S.
Ge Yuchen
Gihawi Abraham
Lu Jennifer
Pertea Mihaela
Puiu Daniela
Salzberg Steven L.
Xu Amanda
Publication venue
Publication date: 01/10/2023
Field of study

We re-analyzed the data from a recent large-scale study that reported strong correlations between DNA signatures of microbial organisms and 33 different cancer types and that created machine-learning predictors with near-perfect accuracy at distinguishing among cancers. We found at least two fundamental flaws in the reported data and in the methods: (i) errors in the genome database and the associated computational methods led to millions of false-positive findings of bacterial reads across all samples, largely because most of the sequences identified as bacteria were instead human; and (ii) errors in the transformation of the raw data created an artificial signature, even for microbes with no reads detected, tagging each tumor type with a distinct signal that the machine-learning programs then used to create an apparently accurate classifier. Each of these problems invalidates the results, leading to the conclusion that the microbiome-based classifiers for identifying cancer presented in the study are entirely wrong. These flaws have subsequently affected more than a dozen additional published studies that used the same data and whose results are likely invalid as well

Directory of Open Access Journals

University of East Anglia digital repository

ccTSA: A Coverage-Centric Threaded Sequence Assembler

Author: B Marteb
BG Jackson
Carl Kingsford
D Culler
DC Richter
DR Kelley
ED Berger
FA Stephen
G Marçais
J Butler
JL Hennessy
JR Miller
JT Simpson
Jung Ho Ahn
PA Pevzner
R Li
RM Elaine
RZ Daniel
SL Salzberg
TF Smith
Z Wenyu
Z Zhang
Publication venue: Public Library of Science
Publication date: 19/06/2012
Field of study

De novo sequencing, a process to find the whole genome or the regions of a species without references, requires much higher computational power compared to mapped sequencing with references. The advent and continuous evolution of next-generation sequencing technologies further stress the demands of high-throughput processing of myriads of short DNA fragments. Recently announced sequence assemblers, such as Velvet, SOAPdenovo, and ABySS, all exploit parallelism to meet these computational demands since contemporary computer systems primarily rely on scaling the number of computing cores to improve performance. However, most of them are not tailored to exploit the full potential of these systems, leading to suboptimal performance. In this paper, we present ccTSA, a parallel sequence assembler that utilizes coverage to prune k-mers, find preferred edges, and resolve conflicts in preferred edges between k-mers. We minimize computation dependencies between threads to effectively parallelize k-mer processing. We also judiciously allocate and reuse memory space in order to lower memory usage and further improve sequencing speed. The results of ccTSA are compelling such that it runs several times faster than other assemblers while providing comparable quality values such as N50

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

A New Rhesus Macaque Assembly and Annotation for Next-Generation Sequencing Analyses

Author: Bosinger Steven E.
Cornish Adam S.
Ferguson Betsy
Fox Howard S.
Gibbs Robert M.
Johnson Zachary P.
Marçais Guillaume
Maudhoo Mnirnal D.
Meehan Daniel T.
Norgren Robert B.
Pandey Sanjit
Roberts Michael
Salzberg Steven L.
Tharp Gregory K.
Treangen Todd
Wipfler Kristin
Yorke James A.
Zhang Xiongfei
Zimin Aleksey V.
Publication venue: DigitalCommons@UNMC
Publication date: 01/01/2014
Field of study

BACKGROUND: The rhesus macaque (Macaca mulatta) is a key species for advancing biomedical research. Like all draft mammalian genomes, the draft rhesus assembly (rheMac2) has gaps, sequencing errors and misassemblies that have prevented automated annotation pipelines from functioning correctly. Another rhesus macaque assembly, CR_1.0, is also available but is substantially more fragmented than rheMac2 with smaller contigs and scaffolds. Annotations for these two assemblies are limited in completeness and accuracy. High quality assembly and annotation files are required for a wide range of studies including expression, genetic and evolutionary analyses. RESULTS: We report a new de novo assembly of the rhesus macaque genome (MacaM) that incorporates both the original Sanger sequences used to assemble rheMac2 and new Illumina sequences from the same animal. MacaM has a weighted average (N50) contig size of 64 kilobases, more than twice the size of the rheMac2 assembly and almost five times the size of the CR_1.0 assembly. The MacaM chromosome assembly incorporates information from previously unutilized mapping data and preliminary annotation of scaffolds. Independent assessment of the assemblies using Ion Torrent read alignments indicates that MacaM is more complete and accurate than rheMac2 and CR_1.0. We assembled messenger RNA sequences from several rhesus tissues into transcripts which allowed us to identify a total of 11,712 complete proteins representing 9,524 distinct genes. Using a combination of our assembled rhesus macaque transcripts and human transcripts, we annotated 18,757 transcripts and 16,050 genes with complete coding sequences in the MacaM assembly. Further, we demonstrate that the new annotations provide greatly improved accuracy as compared to the current annotations of rheMac2. Finally, we show that the MacaM genome provides an accurate resource for alignment of reads produced by RNA sequence expression studies. CONCLUSIONS: The MacaM assembly and annotation files provide a substantially more complete and accurate representation of the rhesus macaque genome than rheMac2 or CR_1.0 and will serve as an important resource for investigators conducting next-generation sequencing studies with nonhuman primates. REVIEWERS: This article was reviewed by Dr. Lutz Walter, Dr. Soojin Yi and Dr. Kateryna Makova

Crossref

Springer - Publisher Connector

PubMed Central

Digital Repository at the University of Maryland

University of Nebraska Medical Center Research: DigitalCommons@UNMC