217 research outputs found

    Digital PCR provides sensitive and absolute calibration for high throughput sequencing

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Next-generation DNA sequencing on the 454, Solexa, and SOLiD platforms requires absolute calibration of the number of molecules to be sequenced. This requirement has two unfavorable consequences. First, large amounts of sample-typically micrograms-are needed for library preparation, thereby limiting the scope of samples which can be sequenced. For many applications, including metagenomics and the sequencing of ancient, forensic, and clinical samples, the quantity of input DNA can be critically limiting. Second, each library requires a titration sequencing run, thereby increasing the cost and lowering the throughput of sequencing.</p> <p>Results</p> <p>We demonstrate the use of digital PCR to accurately quantify 454 and Solexa sequencing libraries, enabling the preparation of sequencing libraries from nanogram quantities of input material while eliminating costly and time-consuming titration runs of the sequencer. We successfully sequenced low-nanogram scale bacterial and mammalian DNA samples on the 454 FLX and Solexa DNA sequencing platforms. This study is the first to definitively demonstrate the successful sequencing of picogram quantities of input DNA on the 454 platform, reducing the sample requirement more than 1000-fold without pre-amplification and the associated bias and reduction in library depth.</p> <p>Conclusion</p> <p>The digital PCR assay allows absolute quantification of sequencing libraries, eliminates uncertainties associated with the construction and application of standard curves to PCR-based quantification, and with a coefficient of variation close to 10%, is sufficiently precise to enable direct sequencing without titration runs.</p

    Methods for comparative metagenomics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Metagenomics is a rapidly growing field of research that aims at studying uncultured organisms to understand the true diversity of microbes, their functions, cooperation and evolution, in environments such as soil, water, ancient remains of animals, or the digestive system of animals and humans. The recent development of ultra-high throughput sequencing technologies, which do not require cloning or PCR amplification, and can produce huge numbers of DNA reads at an affordable cost, has boosted the number and scope of metagenomic sequencing projects. Increasingly, there is a need for new ways of comparing multiple metagenomics datasets, and for fast and user-friendly implementations of such approaches.</p> <p>Results</p> <p>This paper introduces a number of new methods for interactively exploring, analyzing and comparing multiple metagenomic datasets, which will be made freely available in a new, comparative version 2.0 of the stand-alone metagenome analysis tool MEGAN.</p> <p>Conclusion</p> <p>There is a great need for powerful and user-friendly tools for comparative analysis of metagenomic data and MEGAN 2.0 will help to fill this gap.</p

    Short clones or long clones? A simulation study on the use of paired reads in metagenomics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Metagenomics is the study of environmental samples using sequencing. Rapid advances in sequencing technology are fueling a vast increase in the number and scope of metagenomics projects. Most metagenome sequencing projects so far have been based on Sanger or Roche-454 sequencing, as only these technologies provide long enough reads, while Illumina sequencing has not been considered suitable for metagenomic studies due to a short read length of only 35 bp. However, now that reads of length 75 bp can be sequenced in pairs, Illumina sequencing has become a viable option for metagenome studies.</p> <p>Results</p> <p>This paper addresses the problem of taxonomical analysis of paired reads. We describe a new feature of our metagenome analysis software MEGAN that allows one to process sequencing reads in pairs and makes assignments of such reads based on the combined bit scores of their matches to reference sequences. Using this new software in a simulation study, we investigate the use of Illumina paired-sequencing in taxonomical analysis and compare the performance of single reads, short clones and long clones. In addition, we also compare against simulated Roche-454 sequencing runs.</p> <p>Conclusion</p> <p>This work shows that paired reads perform better than single reads, as expected, but also, perhaps slightly less obviously, that long clones allow more specific assignments than short ones. A new version of the program MEGAN that explicitly takes paired reads into account is available from our website.</p

    Genetic characterization of Erve virus, a European Nairovirus distantly related to Crimean-Congo hemorrhagic fever virus

    Get PDF
    Erve virus (ERVEV) is a European Nairovirus that is suspected to cause severe headache (thunderclap headache) and intracerebral hemorrhage. The mode of transmission to humans (ticks or mosquitoes) is still unknown. Currently, no standardized testing method for ERVEV exists and only a small partial sequence of the polymerase gene is available. Here, we present the first complete genome sequence of ERVEV S, M, and L segments. Phylogenetic comparison of the amino acid sequence of the L-protein (RNA-dependent RNA polymerase) revealed only 48 % homology to available L-protein sequences of other Nairoviruses like Crimean-Congo hemorrhagic fever virus, Nairobi sheep disease virus, Hazara virus, Kupe virus, and Dugbe virus. Among themselves, these Nairoviruses show 62-89 % homology in the L-protein sequences. Therefore, ERVEV seems to be only distantly related to other Nairoviruses. The new sequence data can be used for the development of diagnostic methods and the identification of the natural vector

    The PhyloPythiaS Web Server for Taxonomic Assignment of Metagenome Sequences

    Get PDF
    Metagenome sequencing is becoming common and there is an increasing need for easily accessible tools for data analysis. An essential step is the taxonomic classification of sequence fragments. We describe a web server for the taxonomic assignment of metagenome sequences with PhyloPythiaS. PhyloPythiaS is a fast and accurate sequence composition-based classifier that utilizes the hierarchical relationships between clades. Taxonomic assignments with the web server can be made with a generic model, or with sample-specific models that users can specify and create. Several interactive visualization modes and multiple download formats allow quick and convenient analysis and downstream processing of taxonomic assignments. Here, we demonstrate usage of our web server by taxonomic assignment of metagenome samples from an acidophilic biofilm community of an acid mine and of a microbial community from cow rumen

    Heterogeneity assessment of functional T cell avidity.

    Get PDF
    The potency of cellular immune responses strongly depends on T cell avidity to antigen. Yet, functional avidity measurements are rarely performed in patients, mainly due to the technical challenges of characterizing heterogeneous T cells. The mean functional T cell avidity can be determined by the IFN-γ Elispot assay, with titrated amounts of peptide. Using this assay, we developed a method revealing the heterogeneity of functional avidity, represented by the steepness/hillslope of the peptide titration curve, documented by proof of principle experiments and mathematical modeling. Our data show that not only natural polyclonal CD8 T cell populations from cancer patients, but also monoclonal T cells differ strongly in their heterogeneity of functional avidity. Interestingly, clones and polyclonal cells displayed comparable ranges of heterogeneity. We conclude that besides the mean functional avidity, it is feasible and useful to determine its heterogeneity (hillslope) for characterizing T cell responses in basic research and patient investigation

    RAIphy: Phylogenetic classification of metagenomics samples using iterative refinement of relative abundance index profiles

    Get PDF
    Background: Computational analysis of metagenomes requires the taxonomical assignment of the genome contigs assembled from DNA reads of environmental samples. Because of the diverse nature of microbiomes, the length of the assemblies obtained can vary between a few hundred bp to a few hundred Kbp. Current taxonomic classification algorithms provide accurate classification for long contigs or for short fragments from organisms that have close relatives with annotated genomes. These are significant limitations for metagenome analysis because of the complexity of microbiomes and the paucity of existing annotated genomes. Results: We propose a robust taxonomic classification method, RAIphy, that uses a novel sequence similarity metric with iterative refinement of taxonomic models and functions effectively without these limitations. We have tested RAIphy with synthetic metagenomics data ranging between 100 bp to 50 Kbp. Within a sequence read range of 100 bp-1000 bp, the sensitivity of RAIphy ranges between 38%-81% outperforming the currently popular composition-based methods for reads in this range. Comparison with computationally more intensive sequence similarity methods shows that RAIphy performs competitively while being significantly faster. The sensitivityspecificity characteristics for relatively longer contigs were compared with the PhyloPythia and TACOA algorithms. RAIphy performs better than these algorithms at varying clade-levels. For an acid mine drainage (AMD) metagenome, RAIphy was able to taxonomically bin the sequence read set more accurately than the currently available methods, Phymm and MEGAN, and more accurately in two out of three tests than the much more computationally intensive method, PhymmBL. Conclusions: With the introduction of the relative abundance index metric and an iterative classification method, we propose a taxonomic classification algorithm that performs competitively for a large range of DNA contig lengths assembled from metagenome data. Because of its speed, simplicity, and accuracy RAIphy can be successfully used in the binning process for a broad range of metagenomic data obtained from environmental samples

    Assembly complexity of prokaryotic genomes using short reads

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>De Bruijn graphs are a theoretical framework underlying several modern genome assembly programs, especially those that deal with very short reads. We describe an application of de Bruijn graphs to analyze the global repeat structure of prokaryotic genomes.</p> <p>Results</p> <p>We provide the first survey of the repeat structure of a large number of genomes. The analysis gives an upper-bound on the performance of genome assemblers for <it>de novo </it>reconstruction of genomes across a wide range of read lengths. Further, we demonstrate that the majority of genes in prokaryotic genomes can be reconstructed uniquely using very short reads even if the genomes themselves cannot. The non-reconstructible genes are overwhelmingly related to mobile elements (transposons, IS elements, and prophages).</p> <p>Conclusions</p> <p>Our results improve upon previous studies on the feasibility of assembly with short reads and provide a comprehensive benchmark against which to compare the performance of the short-read assemblers currently being developed.</p

    Differential Development of Human Brain White Matter Tracts

    Get PDF
    Neuroscience is increasingly focusing on developmental factors related to human structural and functional connectivity. Unfortunately, to date, diffusion-based imaging approaches have only contributed modestly to these broad objectives, despite the promise of diffusion-based tractography. Here, we report a novel data-driven approach to detect similarities and differences among white matter tracts with respect to their developmental trajectories, using 64-direction diffusion tensor imaging. Specifically, using a cross-sectional sample comprising 144 healthy individuals (7 to 48 years old), we applied k-means cluster analysis to separate white matter voxels based on their age-related trajectories of fractional anisotropy. Optimal solutions included 5-, 9- and 14-clusters. Our results recapitulate well-established tracts (e.g., internal and external capsule, optic radiations, corpus callosum, cingulum bundle, cerebral peduncles) and subdivisions within tracts (e.g., corpus callosum, internal capsule). For all but one tract identified, age-related trajectories were curvilinear (i.e., inverted ‘U-shape’), with age-related increases during childhood and adolescence followed by decreases in middle adulthood. Identification of peaks in the trajectories suggests that age-related losses in fractional anisotropy occur as early as 23 years of age, with mean onset at 30 years of age. Our findings demonstrate that data-driven analytic techniques may be fruitfully applied to extant diffusion tensor imaging datasets in normative and neuropsychiatric samples
    corecore