2,065 research outputs found
Recommended from our members
Reprograming of gut microbiome energy metabolism by the FUT2 Crohn's disease risk polymorphism.
Fucosyltransferase 2 (FUT2) is an enzyme that is responsible for the synthesis of the H antigen in body fluids and on the intestinal mucosa. The H antigen is an oligosaccharide moiety that acts as both an attachment site and carbon source for intestinal bacteria. Non-secretors, who are homozygous for the loss-of-function alleles of FUT2 gene (sese), have increased susceptibility to Crohn's disease (CD). To characterize the effect of FUT2 polymorphism on the mucosal ecosystem, we profiled the microbiome, meta-proteome and meta-metabolome of 75 endoscopic lavage samples from the cecum and sigmoid of 39 healthy subjects (12 SeSe, 18 Sese and 9 sese). Imputed metagenomic analysis revealed perturbations of energy metabolism in the microbiome of non-secretor and heterozygote individuals, notably the enrichment of carbohydrate and lipid metabolism, cofactor and vitamin metabolism and glycan biosynthesis and metabolism-related pathways, and the depletion of amino-acid biosynthesis and metabolism. Similar changes were observed in mice bearing the FUT2(-/-) genotype. Metabolomic analysis of human specimens revealed concordant as well as novel changes in the levels of several metabolites. Human metaproteomic analysis indicated that these functional changes were accompanied by sub-clinical levels of inflammation in the local intestinal mucosa. Therefore, the colonic microbiota of non-secretors is altered at both the compositional and functional levels, affecting the host mucosal state and potentially explaining the association of FUT2 genotype and CD susceptibility
Mumame: A software tool for quantifying gene-specific point-mutations in shotgun metagenomic data
Metagenomics has emerged as a central technique for studying the structure and function of microbial communities. Often the functional analysis is restricted to classification into broad functional categories. However, important phenotypic differences, such as resistance to antibiotics, are often the result of just one or a few point mutations in otherwise identical sequences. Bioinformatic methods for metagenomic analysis have generally been poor at accounting for this fact, resulting in a somewhat limited picture of important aspects of microbial communities. Here, we address this problem by providing a software tool called Mumame, which can distinguish between wildtype and mutated sequences in shotgun metagenomic data and quantify their relative abundances. We demonstrate the utility of the tool by quantifying antibiotic resistance mutations in several publicly available metagenomic data sets. We also identified that sequencing depth is a key factor to detect rare mutations. Therefore, much larger numbers of sequences may be required for reliable detection of mutations than for most other applications of shotgun metagenomics. Mumame is freely available online (http://microbiology.se/software/mumame)
Recommended from our members
Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases.
Inflammatory bowel diseases, which include Crohn's disease and ulcerative colitis, affect several million individuals worldwide. Crohn's disease and ulcerative colitis are complex diseases that are heterogeneous at the clinical, immunological, molecular, genetic, and microbial levels. Individual contributing factors have been the focus of extensive research. As part of the Integrative Human Microbiome Project (HMP2 or iHMP), we followed 132 subjects for one year each to generate integrated longitudinal molecular profiles of host and microbial activity during disease (up to 24 time points each; in total 2,965 stool, biopsy, and blood specimens). Here we present the results, which provide a comprehensive view of functional dysbiosis in the gut microbiome during inflammatory bowel disease activity. We demonstrate a characteristic increase in facultative anaerobes at the expense of obligate anaerobes, as well as molecular disruptions in microbial transcription (for example, among clostridia), metabolite pools (acylcarnitines, bile acids, and short-chain fatty acids), and levels of antibodies in host serum. Periods of disease activity were also marked by increases in temporal variability, with characteristic taxonomic, functional, and biochemical shifts. Finally, integrative analysis identified microbial, biochemical, and host factors central to this dysregulation. The study's infrastructure resources, results, and data, which are available through the Inflammatory Bowel Disease Multi'omics Database ( http://ibdmdb.org ), provide the most comprehensive description to date of host and microbial activities in inflammatory bowel diseases
Comparison of normalization methods for the analysis of metagenomic gene abundance data
Background: In shotgun metagenomics, microbial communities are studied through direct sequencing of DNA without any prior cultivation. By comparing gene abundances estimated from the generated sequencing reads, functional differences between the communities can be identified. However, gene abundance data is affected by high levels of systematic variability, which can greatly reduce the statistical power and introduce false positives. Normalization, which is the process where systematic variability is identified and removed, is therefore a vital part of the data analysis. A wide range of normalization methods for high-dimensional count data has been proposed but their performance on the analysis of shotgun metagenomic data has not been evaluated. Results: Here, we present a systematic evaluation of nine normalization methods for gene abundance data. The methods were evaluated through resampling of three comprehensive datasets, creating a realistic setting that preserved the unique characteristics of metagenomic data. Performance was measured in terms of the methods ability to identify differentially abundant genes (DAGs), correctly calculate unbiased p-values and control the false discovery rate (FDR). Our results showed that the choice of normalization method has a large impact on the end results. When the DAGs were asymmetrically present between the experimental conditions, many normalization methods had a reduced true positive rate (TPR) and a high false positive rate (FPR). The methods trimmed mean of M-values (TMM) and relative log expression (RLE) had the overall highest performance and are therefore recommended for the analysis of gene abundance data. For larger sample sizes, CSS also showed satisfactory performance. Conclusions: This study emphasizes the importance of selecting a suitable normalization methods in the analysis of data from shotgun metagenomics. Our results also demonstrate that improper methods may result in unacceptably high levels of false positives, which in turn may lead to incorrect or obfuscated biological interpretation
Diverse secondary metabolites are expressed in particle-associated and free-living microorganisms of the permanently anoxic Cariaco Basin
Secondary metabolites play essential roles in ecological interactions and nutrient acquisition, and are of interest for their potential uses in medicine and biotechnology. Genome mining for biosynthetic gene clusters (BGCs) can be used for the discovery of new compounds. Here, we use metagenomics and metatranscriptomics to analyze BGCs in free-living and particle-associated microbial communities through the stratified water column of the Cariaco Basin, Venezuela. We recovered 565 bacterial and archaeal metagenome-assembled genomes (MAGs) and identified 1154 diverse BGCs. We show that differences in water redox potential and microbial lifestyle (particle-associated vs. free-living) are associated with variations in the predicted composition and production of secondary metabolites. Our results indicate that microbes, including understudied clades such as Planctomycetota, potentially produce a wide range of secondary metabolites in these anoxic/euxinic waters
Novel Methods for Metagenomic Analysis
By sampling the genetic content of microbes at the nucleotide level, metagenomics
has rapidly established itself as the standard in characterizing the taxonomic diversity
and functional capacity of microbial populations throughout nature. The decreasing
cost of sequencing technologies and the simultaneous increase of throughput per run
has given scientists the ability to deeply sample highly diverse communities on a
reasonable budget. The Human Microbiome Project is representative of the flood of
sequence data that will arrive in the coming years. Despite these advancements, there
remains the significant challenge of analyzing massive metagenomic datasets to make
appropriate biological conclusions. This dissertation is a collection of novel methods
developed for improved analysis of metagenomic data: (1) We begin with Figaro, a
statistical algorithm that quickly and accurately infers and trims vector sequence from
large Sanger-based read sets without prior knowledge of the vector used in library
construction. (2) Next, we perform a rigorous evaluation of methodologies used to cluster environmental 16S rRNA sequences into species-level operational taxonomic
units, and discover that many published studies utilize highly stringent parameters,
resulting in overestimation of microbial diversity. (3) To assist in comparative
metagenomics studies, we have created Metastats, a robust statistical methodology for
comparing large-scale clinical datasets with up to thousands of subjects. Given a
collection of annotated metagenomic features (e.g. taxa, COGs, or pathways),
Metastats determines which features are differentially abundant between two
populations. (4) Finally, we report on a new methodology that employs the
generalized Lotka-Volterra model to infer microbe-microbe interactions from
longitudinal 16S rRNA data. It is our hope that these methods will enhance standard
metagenomic analysis techniques to provide better insight into the human
microbiome and microbial communities throughout our world. To assist
metagenomics researchers and those developing methods, all software described in
this thesis is open-source and available online
Measuring the human gut microbiome: new tools and non alcoholic fatty liver disease
With the advent of next generation DNA and RNA sequencing, scientists can obtain a more comprehensive snapshot of the bacterial communities on the human body (known as the `human microbiome\u27), leading to information about the bacterial composition, what genes are present, and what proteins are produced. The scientific community is in a phase of developing the experiments and accompanying statistical techniques to investigate the mechanisms by which the human microbiome affects health and disease. In this thesis, I explore alternatives to the standard weighted and unweighted UniFrac distance metric that measure the difference between microbiome samples. These alternative weightings allow for the extraction of subtle differences between samples and identification of outliers not visible with traditional methods. I also apply next generation DNA sequencing and computational analysis techniques to gut microbiome data from a nonalcoholic fatty liver disease cohort to examine the potential role of the microbiota in this condition
Sample preservation and storage significantly impact taxonomic and functional profiles in metaproteomics studies of the human gut microbiome
With the technological advances of the last decade, it is now feasible to analyze microbiome samples, such as human stool specimens, using multi-omic techniques. Given the inherent sample complexity, there exists a need for sample methods which preserve as much information as possible about the biological system at the time of sampling. Here, we analyzed human stool samples preserved and stored using different methods, applying metagenomics as well as metaproteomics. Our results demonstrate that sample preservation and storage have a significant effect on the taxonomic composition of identified proteins. The overall identification rates, as well as the proportion of proteins from were much higher when samples were flash frozen. Preservation in RNAlater overall led to fewer protein identifications and a considerable increase in the share of , as well as . Additionally, a decrease in the share of metabolism-related proteins and an increase of the relative amount of proteins involved in the processing of genetic information was observed for RNAlater-stored samples. This suggests that great care should be taken in choosing methods for the preservation and storage of microbiome samples, as well as in comparing the results of analyses using different sampling and storage methods. Flash freezing and subsequent storage at -80 °C should be chosen wherever possible
Sample Preservation and Storage Significantly Impact Taxonomic and Functional Profiles in Metaproteomics Studies of the Human Gut Microbiome
With the technological advances of the last decade, it is now feasible to analyze microbiome samples, such as human stool specimens, using multi-omic techniques. Given the inherent sample complexity, there exists a need for sample methods which preserve as much information as possible about the biological system at the time of sampling. Here, we analyzed human stool samples preserved and stored using different methods, applying metagenomics as well as metaproteomics. Our results demonstrate that sample preservation and storage have a significant effect on the taxonomic composition of identified proteins. The overall identification rates, as well as the proportion of proteins from Actinobacteria were much higher when samples were flash frozen. Preservation in RNAlater overall led to fewer protein identifications and a considerable increase in the share of Bacteroidetes, as well as Proteobacteria. Additionally, a decrease in the share of metabolism-related proteins and an increase of the relative amount of proteins involved in the processing of genetic information was observed for RNAlater-stored samples. This suggests that great care should be taken in choosing methods for the preservation and storage of microbiome samples, as well as in comparing the results of analyses using different sampling and storage methods. Flash freezing and subsequent storage at −80 °C should be chosen wherever possible
- …