29 research outputs found

    HypoRiPPAtlas as an Atlas of hypothetical natural products for mass spectrometry database search

    Get PDF
    Recent analyses of public microbial genomes have found over a million biosynthetic gene clusters, the natural products of the majority of which remain unknown. Additionally, GNPS harbors billions of mass spectra of natural products without known structures and biosynthetic genes. We bridge the gap between large-scale genome mining and mass spectral datasets for natural product discovery by developing HypoRiPPAtlas, an Atlas of hypothetical natural product structures, which is ready-to-use for in silico database search of tandem mass spectra. HypoRiPPAtlas is constructed by mining genomes using seq2ripp, a machine-learning tool for the prediction of ribosomally synthesized and post-translationally modified peptides (RiPPs). In HypoRiPPAtlas, we identify RiPPs in microbes and plants. HypoRiPPAtlas could be extended to other natural product classes in the future by implementing corresponding biosynthetic logic. This study paves the way for large-scale explorations of biosynthetic pathways and chemical structures of microbial and plant RiPP classes

    Integrating genomics and metabolomics for scalable non-ribosomal peptide discovery.

    Get PDF
    Non-Ribosomal Peptides (NRPs) represent a biomedically important class of natural products that include a multitude of antibiotics and other clinically used drugs. NRPs are not directly encoded in the genome but are instead produced by metabolic pathways encoded by biosynthetic gene clusters (BGCs). Since the existing genome mining tools predict many putative NRPs synthesized by a given BGC, it remains unclear which of these putative NRPs are correct and how to identify post-assembly modifications of amino acids in these NRPs in a blind mode, without knowing which modifications exist in the sample. To address this challenge, here we report NRPminer, a modification-tolerant tool for NRP discovery from large (meta)genomic and mass spectrometry datasets. We show that NRPminer is able to identify many NRPs from different environments, including four previously unreported NRP families from soil-associated microbes and NRPs from human microbiota. Furthermore, in this work we demonstrate the anti-parasitic activities and the structure of two of these NRP families using direct bioactivity screening and nuclear magnetic resonance spectrometry, illustrating the power of NRPminer for discovering bioactive NRPs

    American Gut: An Open Platform For Citizen Science Microbiome Research

    Get PDF
    Copyright © 2018 McDonald et al. Although much work has linked the human microbiome to specific phenotypes and lifestyle variables, data from different projects have been challenging to integrate and the extent of microbial and molecular diversity in human stool remains unknown. Using standardized protocols from the Earth Microbiome Project and sample contributions from over 10,000 citizen-scientists, together with an open research network, we compare human microbiome specimens primarily from the United States, United Kingdom, and Australia to one another and to environmental samples. Our results show an unexpected range of beta-diversity in human stool microbiomes compared to environmental samples; demonstrate the utility of procedures for removing the effects of overgrowth during room-temperature shipping for revealing phenotype correlations; uncover new molecules and kinds of molecular communities in the human stool metabolome; and examine emergent associations among the microbiome, metabolome, and the diversity of plants that are consumed (rather than relying on reductive categorical variables such as veganism, which have little or no explanatory power). We also demonstrate the utility of the living data resource and cross-cohort comparison to confirm existing associations between the microbiome and psychiatric illness and to reveal the extent of microbiome change within one individual during surgery, providing a paradigm for open microbiome research and education. IMPORTANCE We show that a citizen science, self-selected cohort shipping samples through the mail at room temperature recaptures many known microbiome results from clinically collected cohorts and reveals new ones. Of particular interest is integrating n = 1 study data with the population data, showing that the extent of microbiome change after events such as surgery can exceed differences between distinct environmental biomes, and the effect of diverse plants in the diet, which we confirm with untargeted metabolomics on hundreds of samples

    American Gut: an Open Platform for Citizen Science Microbiome Research

    Get PDF
    McDonald D, Hyde E, Debelius JW, et al. American Gut: an Open Platform for Citizen Science Microbiome Research. mSystems. 2018;3(3):e00031-18

    Computational aspects of dna self-assembly systems at temperature 1

    Get PDF
    In this thesis, we investigate the computational power of some variants of Winfree\u27s abstract Tile Assembly Model (aTAM) at Temperature 1 [43]. Although aTAM at temperatures higher than 1 are proved to be Turing Universal, i.e. they can simulate an arbitrary Turing Machine [43], the computational power of aTAM at temperature 1 is still an open question. It is known that some modifications of aTAM are indeed Turing Universal at temperature 1 [11, 30]. In this thesis, we first show that two variants of aTAM, namely the Staged Tile Assembly Model and Step-wise Tile Assembly Model at Temperature 1, are also Turing Universal. Next, we discuss the computational power of the self-assembly with triangular tiles and hexagonal tiles, respectively. We prove that these models can simulate arbitrary systems under aTAM and vice versa, and consequently, they have the same computational power as aTAM

    RNA-seq data and transcriptome assembly results.

    No full text
    <p>Sequences were generated using 75 bp paired end reads.</p><p>RNA-seq data and transcriptome assembly results.</p

    Pathway analysis for liver transcripts from <i>R. catesbeiana</i> (CAT) and <i>X. laevis</i> (LAE).

    No full text
    <p>Top 25 impacted pathways after TH treatment for <i>R. catesbeiana</i> ranked by the highest proportion of overall observed genes. The pathway names are indicated in the center of the figure with the total number of genes known in each IGA pathway indicated. The asterisk indicates those pathways that are found in the top 25 list of <i>X. laevis</i>. The colour coded bar plots illustrate the percentage of the total number of gene transcripts in a pathway that are downregulated (blue), non-responsive (yellow), upregulated (red) or not observed in the experiment (gray) relative to the control condition. Differentially expressed transcripts were determined using a p-value threshold of 5%.</p
    corecore