236 research outputs found

    Reducing the Effects of PCR Amplification and Sequencing Artifacts on 16S rRNA-Based Studies

    Get PDF
    The advent of next generation sequencing has coincided with a growth in interest in using these approaches to better understand the role of the structure and function of the microbial communities in human, animal, and environmental health. Yet, use of next generation sequencing to perform 16S rRNA gene sequence surveys has resulted in considerable controversy surrounding the effects of sequencing errors on downstream analyses. We analyzed 2.7×10[superscript 6] reads distributed among 90 identical mock community samples, which were collections of genomic DNA from 21 different species with known 16S rRNA gene sequences; we observed an average error rate of 0.0060. To improve this error rate, we evaluated numerous methods of identifying bad sequence reads, identifying regions within reads of poor quality, and correcting base calls and were able to reduce the overall error rate to 0.0002. Implementation of the PyroNoise algorithm provided the best combination of error rate, sequence length, and number of sequences. Perhaps more problematic than sequencing errors was the presence of chimeras generated during PCR. Because we knew the true sequences within the mock community and the chimeras they could form, we identified 8% of the raw sequence reads as chimeric. After quality filtering the raw sequences and using the Uchime chimera detection program, the overall chimera rate decreased to 1%. The chimeras that could not be detected were largely responsible for the identification of spurious operational taxonomic units (OTUs) and genus-level phylotypes. The number of spurious OTUs and phylotypes increased with sequencing effort indicating that comparison of communities should be made using an equal number of sequences. Finally, we applied our improved quality-filtering pipeline to several benchmarking studies and observed that even with our stringent data curation pipeline, biases in the data generation pipeline and batch effects were observed that could potentially confound the interpretation of microbial community data.National Institutes of Health (U.S.) (1R01HG005975-01)National Science Foundation (U.S.) (award #0743432)National Institutes of Health (U.S.) (grant NIHU54HG004969

    Microbial Co-occurrence Relationships in the Human Microbiome

    Get PDF
    The healthy microbiota show remarkable variability within and among individuals. In addition to external exposures, ecological relationships (both oppositional and symbiotic) between microbial inhabitants are important contributors to this variation. It is thus of interest to assess what relationships might exist among microbes and determine their underlying reasons. The initial Human Microbiome Project (HMP) cohort, comprising 239 individuals and 18 different microbial habitats, provides an unprecedented resource to detect, catalog, and analyze such relationships. Here, we applied an ensemble method based on multiple similarity measures in combination with generalized boosted linear models (GBLMs) to taxonomic marker (16S rRNA gene) profiles of this cohort, resulting in a global network of 3,005 significant co-occurrence and co-exclusion relationships between 197 clades occurring throughout the human microbiome. This network revealed strong niche specialization, with most microbial associations occurring within body sites and a number of accompanying inter-body site relationships. Microbial communities within the oropharynx grouped into three distinct habitats, which themselves showed no direct influence on the composition of the gut microbiota. Conversely, niches such as the vagina demonstrated little to no decomposition into region-specific interactions. Diverse mechanisms underlay individual interactions, with some such as the co-exclusion of Porphyromonaceae family members and Streptococcus in the subgingival plaque supported by known biochemical dependencies. These differences varied among broad phylogenetic groups as well, with the Bacilli and Fusobacteria, for example, both enriched for exclusion of taxa from other clades. Comparing phylogenetic versus functional similarities among bacteria, we show that dominant commensal taxa (such as Prevotellaceae and Bacteroides in the gut) often compete, while potential pathogens (e.g. Treponema and Prevotella in the dental plaque) are more likely to co-occur in complementary niches. This approach thus serves to open new opportunities for future targeted mechanistic studies of the microbial ecology of the human microbiome

    Metagenomic biomarker discovery and explanation

    Get PDF
    This study describes and validates a new method for metagenomic biomarker discovery by way of class comparison, tests of biological consistency and effect size estimation. This addresses the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities, which is a central problem to the study of metagenomics. We extensively validate our method on several microbiomes and a convenient online interface for the method is provided at http://huttenhower.sph.harvard.edu/lefse/.National Institute of Dental and Craniofacial Research (U.S.) (grant DE017106)National Institutes of Health (U.S.) (NIH grant AI078942)Burroughs Wellcome FundNational Institutes of Health (U.S.) (NIH 1R01HG005969

    Relating the metatranscriptome and metagenome of the human gut

    Get PDF
    Although the composition of the human microbiome is now wellstudied, the microbiota’s \u3e8 million genes and their regulation remain largely uncharacterized. This knowledge gap is in part because of the difficulty of acquiring large numbers of samples amenable to functional studies of the microbiota. We conducted what is, to our knowledge, one of the first human microbiome studies in a well-phenotyped prospective cohort incorporating taxonomic, metagenomic, and metatranscriptomic profiling at multiple body sites using self-collected samples. Stool and saliva were provided by eight healthy subjects, with the former preserved by three different methods (freezing, ethanol, and RNAlater) to validate self-collection. Within-subject microbial species, gene, and transcript abundances were highly concordant across sampling methods, with only a small fraction of transcripts (\u3c5%) displaying between-method variation. Next, we investigated relationships between the oral and gut microbial communities, identifying a subset of abundant oral microbes that routinely survive transit to the gut, but with minimal transcriptional activity there. Finally, systematic comparison of the gut metagenome and metatranscriptome revealed that a substantial fraction (41%) of microbial transcripts were not differentially regulated relative to their genomic abundances. Of the remainder, consistently underexpressed pathways included sporulation and amino acid biosynthesis, whereas up-regulated pathways included ribosome biogenesis and methanogenesis. Across subjects, metatranscriptional profiles were significantly more individualized than DNA-level functional profiles, but less variable than microbial composition, indicative of subject-specific whole-community regulation. The results thus detail relationships between community genomic potential and gene expression in the gut, and establish the feasibility of metatranscriptomic investigations in subject-collected and shipped samples

    Composition of the Adult Digestive Tract Bacterial Microbiome Based on Seven Mouth Surfaces, Tonsils, Throat and Stool Samples

    Get PDF
    Background: To understand the relationship between our bacterial microbiome and health, it is essential to define the microbiome in the absence of disease. The digestive tract includes diverse habitats and hosts the human body's greatest bacterial density. We describe the bacterial community composition of ten digestive tract sites from more than 200 normal adults enrolled in the Human Microbiome Project, and metagenomically determined metabolic potentials of four representative sites. Results: The microbiota of these diverse habitats formed four groups based on similar community compositions: buccal mucosa, keratinized gingiva, hard palate; saliva, tongue, tonsils, throat; sub- and supra-gingival plaques; and stool. Phyla initially identified from environmental samples were detected throughout this population, primarily TM7, SR1, and Synergistetes. Genera with pathogenic members were well-represented among this disease-free cohort. Tooth-associated communities were distinct, but not entirely dissimilar, from other oral surfaces. The Porphyromonadaceae, Veillonellaceae and Lachnospiraceae families were common to all sites, but the distributions of their genera varied significantly. Most metabolic processes were distributed widely throughout the digestive tract microbiota, with variations in metagenomic abundance between body habitats. These included shifts in sugar transporter types between the supragingival plaque, other oral surfaces, and stool; hydrogen and hydrogen sulfide production were also differentially distributed. Conclusions: The microbiomes of ten digestive tract sites separated into four types based on composition. A core set of metabolic pathways was present across these diverse digestive tract habitats. These data provide a critical baseline for future studies investigating local and systemic diseases affecting human health

    Advancing the Microbiome Research Community

    Get PDF
    The human microbiome has become a recognized factor in promoting and maintaining health. We outline opportunities in interdisciplinary research, analytical rigor, standardization, and policy development for this relatively new and rapidly developing field. Advances in these aspects of the research community may in turn advance our understanding of human microbiome biology. It is now widely recognized that disturbances in our normal microbial populations may be linked to acute infections such as Clostridium difficile and to chronic diseases such as heart disease, cancer, obesity, and autoimmune disorders (Clemente et al., 2012). This has prompted substantial interest in the microbiome from both basic and clinical perspectives. Although our genome is relatively static throughout life, each of our microbial communities changes profoundly from infancy through adulthood, continuing to adapt through ongoing exposures to diet, drugs and environment. Understanding the microbiome and its dynamic nature may be critical for diagnostics and, eventually, interventions based on the microbiome itself. However, several important challenges limit the ability of researchers to enter the microbiome field and/or conduct research most effectively

    The Human Microbiome Project: A Community Resource for the Healthy Human Microbiome

    Get PDF
    The Human Microbiome Project (HMP) [1],[2] is a concept that was long in the making. After the Human Genome Project, interest grew in sequencing the “other genome" of microbes carried in and on the human body [3],[4]. Microbial ecologists, realizing that >99% of environmental microbes could not be easily cultured, developed approaches to study microorganisms in situ [5], primarily by sequencing the 16S ribosomal RNA gene (16S) as a phylogenetic and taxonomic marker to identify members of microbial communities [6]. The need to develop corresponding new methods for culture-independent studies [7],[8] in turn precipitated a sea change in the study of microbes and human health, inspiring the new term “metagenomics" [9] both to describe a technological approach—sequencing and analysis of the genes from whole communities rather than from individual genomes—and to emphasize that microbes function within communities rather than as individual species. This shift from a focus on individual organisms to microbial interactions [10] culminated in a National Academy of Science report [11], which outlined challenges and promises for metagenomics as a way of understanding the foundational role of microbial communities both in the environment and in human health.National Institutes of Health (U.S.) (grant U54HG004969)National Institutes of Health (U.S.) (grant U54HG004973)National Institutes of Health (U.S.) (grant U54AI084844)National Institutes of Health (U.S.) (grant U01HG004866)National Institutes of Health (U.S.) (grant R01HG005969)National Institutes of Health (U.S.) (grant R01HG004872)United States. Army Research Office (grant W911NF-11-1-0473)National Science Foundation (U.S.) (NSF DBI-1053486)Howard Hughes Medical Institute (Early Career Scientist
    corecore