1,771 research outputs found

    High-resolution microbial community reconstruction by integrating short reads from multiple 16S rRNA regions

    Get PDF
    The emergence of massively parallel sequencing technology has revolutionized microbial profiling, allowing the unprecedented comparison of microbial diversity across time and space in a wide range of host-associated and environmental ecosystems. Although the high-throughput nature of such methods enables the detection of low-frequency bacteria, these advances come at the cost of sequencing read length, limiting the phylogenetic resolution possible by current methods. Here, we present a generic approach for integrating short reads from large genomic regions, thus enabling phylogenetic resolution far exceeding current methods. The approach is based on a mapping to a statistical model that is later solved as a constrained optimization problem. We demonstrate the utility of this method by analyzing human saliva and Drosophila samples, using Illumina single-end sequencing of a 750 bp amplicon of the 16S rRNA gene. Phylogenetic resolution is significantly extended while reducing the number of falsely detected bacteria, as compared with standard single-region Roche 454 Pyrosequencing. Our approach can be seamlessly applied to simultaneous sequencing of multiple genes providing a higher resolution view of the composition and activity of complex microbial communities

    Divergent responses of viral and bacterial communities in the gut microbiome to dietary disturbances in mice

    Get PDF
    To improve our understanding of the stability of mammalian intestinal communities, we characterized the responses of both bacterial and viral communities in murine fecal samples to dietary changes between high- and low-fat (LF) diets. Targeted DNA extraction methods for bacteria, virus-like particles and induced prophages were used to generate bacterial and viral metagenomes as well as 16S ribosomal RNA amplicons. Gut microbiome communities from two cohorts of C57BL/6 mice were characterized in a 6-week diet perturbation study in response to high fiber, LF and high-refined sugar, milkfat (MF) diets. The resulting metagenomes from induced bacterial prophages and extracellular viruses showed significant overlap, supporting a largely temperate viral lifestyle within these gut microbiomes. The resistance of baseline communities to dietary disturbances was evaluated, and we observed contrasting responses of baseline LF and MF bacterial and viral communities. In contrast to baseline LF viral communities and bacterial communities in both diet treatments, baseline MF viral communities were sensitive to dietary disturbances as reflected in their non-recovery during the washout period. The contrasting responses of bacterial and viral communities suggest that these communities can respond to perturbations independently of each other and highlight the potentially unique role of viruses in gut health

    MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets

    Get PDF
    Enabled by rapid advances in sequencing technology, metagenomic studies aim to characterize entire communities of microbes bypassing the need for culturing individual bacterial members. One major goal of metagenomic studies is to identify specific functional adaptations of microbial communities to their habitats. The functional profile and the abundances for a sample can be estimated by mapping metagenomic sequences to the global metabolic network consisting of thousands of molecular reactions. Here we describe a powerful analytical method (MetaPath) that can identify differentially abundant pathways in metagenomic datasets, relying on a combination of metagenomic sequence data and prior metabolic pathway knowledge. First, we introduce a scoring function for an arbitrary subnetwork and find the max-weight subnetwork in the global network by a greedy search algorithm. Then we compute two p values (p abund and p struct ) using nonparametric approaches to answer two different statistical questions: (1) is this subnetwork differentically abundant? (2) What is the probability of finding such good subnetworks by chance given the data and network structure? Finally, significant metabolic subnetworks are discovered based on these two p values. In order to validate our methods, we have designed a simulated metabolic pathways dataset and show that MetaPath outperforms other commonly used approaches. We also demonstrate the power of our methods in analyzing two publicly available metagenomic datasets, and show that the subnetworks identified by MetaPath provide valuable insights into the biological activities of the microbiome. We have introduced a statistical method for finding significant metabolic subnetworks from metagenomic datasets. Compared with previous methods, results from MetaPath are more robust against noise in the data, and have significantly higher sensitivity and specificity (when tested on simulated datasets). When applied to two publicly available metagenomic datasets, the output of MetaPath is consistent with previous observations and also provides several new insights into the metabolic activity of the gut microbiome. The software is freely available at http://metapath.cbcb.umd.edu .https://doi.org/10.1186/1753-6561-5-S2-S

    Use of dietary indices to control for diet in human gut microbiota studies

    Get PDF
    Background: Environmental factors have a large influence on the composition of the human gut microbiota. One of the most influential and well-studied is host diet. To assess and interpret the impact of non-dietary factors on the gut microbiota, we endeavoured to determine the most appropriate method to summarise community variation attributable to dietary effects. Dietary habits are multidimensional with internal correlations. This complexity can be simplified by using dietary indices that quantify dietary variance in a single measure and offer a means of controlling for diet in microbiota studies. However, to date, the applicability of different dietary indices to gut microbiota studies has not been assessed. Here, we use food frequency questionnaire (FFQ) data from members of the TwinsUK cohort to create three different dietary measures applicable in western-diet populations: The Healthy Eating Index (HEI), the Mediterranean Diet Score (MDS) and the Healthy Food Diversity index (HFD-Index). We validate and compare these three indices to determine which best summarises dietary influences on gut microbiota composition. Results: All three indices were independently validated using established measures of health, and all were significantly associated with microbiota measures; the HEI had the highest t values in models of alpha diversity measures, and had the highest number of associations with microbial taxa. Beta diversity analyses showed the HEI explained the greatest variance of microbiota composition. In paired tests between twins discordant for dietary index score, the HEI was associated with the greatest variation of taxa and twin dissimilarity. Conclusions: We find that the HEI explains the most variance in, and has the strongest association with, gut microbiota composition in a western (UK) population, suggesting that it may be the best summary measure to capture gut microbiota variance attributable to habitual diet in comparable populations

    Accurate Genome Relative Abundance Estimation Based on Shotgun Metagenomic Reads

    Get PDF
    Accurate estimation of microbial community composition based on metagenomic sequencing data is fundamental for subsequent metagenomics analysis. Prevalent estimation methods are mainly based on directly summarizing alignment results or its variants; often result in biased and/or unstable estimates. We have developed a unified probabilistic framework (named GRAMMy) by explicitly modeling read assignment ambiguities, genome size biases and read distributions along the genomes. Maximum likelihood method is employed to compute Genome Relative Abundance of microbial communities using the Mixture Model theory (GRAMMy). GRAMMy has been demonstrated to give estimates that are accurate and robust across both simulated and real read benchmark datasets. We applied GRAMMy to a collection of 34 metagenomic read sets from four metagenomics projects and identified 99 frequent species (minimally 0.5% abundant in at least 50% of the data- sets) in the human gut samples. Our results show substantial improvements over previous studies, such as adjusting the over-estimated abundance for Bacteroides species for human gut samples, by providing a new reference-based strategy for metagenomic sample comparisons. GRAMMy can be used flexibly with many read assignment tools (mapping, alignment or composition-based) even with low-sensitivity mapping results from huge short-read datasets. It will be increasingly useful as an accurate and robust tool for abundance estimation with the growing size of read sets and the expanding database of reference genomes

    Lifestyle and geographic insights into the distinct gut microbiota in elderly women from two different geographic locations

    Get PDF
    BACKGROUND: A large number of microorganisms reside within the gastrointestinal tract, especially in the colon, and play important roles in human health and disease. The composition of the human gut microbiota is determined by intrinsic host factors and environmental factors. While investigating environmental factors to promote human health is of great interest, few studies have focused on their effect on the gut microbiota. This study aimed to investigate differences in gut microbiota composition according to lifestyle and geographical area, even in people with similar genetic background. METHODS: We enrolled ten and nine elderly women in their seventies from island and inland areas, respectively. Fecal samples were obtained from individuals, and bacterial 16S ribosomal RNA genes were analyzed by next-generation sequencing to define the gut microbiota composition. We assessed their diet, which can influence the gut microbial community. We also conducted physical examination and determined the physical activity levels of the subjects. RESULTS: The inland subjects had a significantly higher rectal temperature, systolic blood pressure, and heart rate and a significantly lower physical activity score than the island subjects. Fecal samples from the island group showed a tendency to have greater microbial diversity than those from the inland group. Interestingly, the microbial community composition differed significantly between the two groups. Catenibacterium was enriched in subjects from the island area. Catenibacterium showed a negative correlation with rectal temperature and a positive correlation with the dietary level of animal fat. In contrast, Butyricimonas was enriched in the inland subjects. A positive correlation was found between Butyricimonas and mean arterial pressure. CONCLUSIONS: This study identified differences in the gut microbiota composition between elderly women from different parts of South Korea, and our findings suggest that further studies of the human gut microbiota should evaluate aspects of the living environment

    Dirichlet Multinomial Mixtures: Generative Models for Microbial Metagenomics

    Get PDF
    We introduce Dirichlet multinomial mixtures (DMM) for the probabilistic modelling of microbial metagenomics data. This data can be represented as a frequency matrix giving the number of times each taxa is observed in each sample. The samples have different size, and the matrix is sparse, as communities are diverse and skewed to rare taxa. Most methods used previously to classify or cluster samples have ignored these features. We describe each community by a vector of taxa probabilities. These vectors are generated from one of a finite number of Dirichlet mixture components each with different hyperparameters. Observed samples are generated through multinomial sampling. The mixture components cluster communities into distinct ‘metacommunities’, and, hence, determine envirotypes or enterotypes, groups of communities with a similar composition. The model can also deduce the impact of a treatment and be used for classification. We wrote software for the fitting of DMM models using the ‘evidence framework’ (http://code.google.com/p/microbedmm/). This includes the Laplace approximation of the model evidence. We applied the DMM model to human gut microbe genera frequencies from Obese and Lean twins. From the model evidence four clusters fit this data best. Two clusters were dominated by Bacteroides and were homogenous; two had a more variable community composition. We could not find a significant impact of body mass on community structure. However, Obese twins were more likely to derive from the high variance clusters. We propose that obesity is not associated with a distinct microbiota but increases the chance that an individual derives from a disturbed enterotype. This is an example of the ‘Anna Karenina principle (AKP)’ applied to microbial communities: disturbed states having many more configurations than undisturbed. We verify this by showing that in a study of inflammatory bowel disease (IBD) phenotypes, ileal Crohn's disease (ICD) is associated with a more variable community
    corecore