Search CORE

18 research outputs found

A Graphical Model for Fusing Diverse Microbiome Data

Author: Aktukmak Mehmet
Chevrette Marc G.
Handelsman Jo
Hero Alfred
Magesh Shruthi
Nepper Julia
Zhu Haonan
Publication venue
Publication date: 26/12/2022
Field of study

This paper develops a Bayesian graphical model for fusing disparate types of count data. The motivating application is the study of bacterial communities from diverse high dimensional features, in this case transcripts, collected from different treatments. In such datasets, there are no explicit correspondences between the communities and each correspond to different factors, making data fusion challenging. We introduce a flexible multinomial-Gaussian generative model for jointly modeling such count data. This latent variable model jointly characterizes the observed data through a common multivariate Gaussian latent space that parameterizes the set of multinomial probabilities of the transcriptome counts. The covariance matrix of the latent variables induces a covariance matrix of co-dependencies between all the transcripts, effectively fusing multiple data sources. We present a computationally scalable variational Expectation-Maximization (EM) algorithm for inferring the latent variables and the parameters of the model. The inferred latent variables provide a common dimensionality reduction for visualizing the data and the inferred parameters provide a predictive posterior distribution. In addition to simulation studies that demonstrate the variational EM procedure, we apply our model to a bacterial microbiome dataset

arXiv.org e-Print Archive

antiSMASH 4.0—improvements in chemistry prediction and gene cluster boundary identification

Author: Blin Kai
Breitling Rainer
Chevrette Marc G.
Dickschat Jeroen S.
Emmanuel de los Santos L. C.
Kautsar Satria A.
Kim Hyun Uk
Lee Sang Yup
Lu Xiaowen
Medema Marnix H.
Mitchell Douglas A.
Nave Mariana
Schwalen Christopher J.
Shelest Ekaterina
Suarez Duran Hernando G.
Takano Eriko
Weber Tilmann
Wolf Thomas
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2017
Field of study

Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding the production of such compounds. Since 2011, the ‘antibiotics and secondary metabolite analysis shell—antiSMASH’ has assisted researchers in efficiently performing this, both as a web server and a standalone tool. Here, we present the thoroughly updated antiSMASH version 4, which adds several novel features, including prediction of gene cluster boundaries using the ClusterFinder method or the newly integrated CASSIS algorithm, improved substrate specificity prediction for non-ribosomal peptide synthetase adenylation domains based on the new SANDPUMA algorithm, improved predictions for terpene and ribosomally synthesized and post-translationally modified peptides cluster products, reporting of sequence similarity to proteins encoded in experimentally characterized gene clusters on a per-protein basis and a domain-level alignment tool for comparative analysis of trans-AT polyketide synthase assembly line architectures. Additionally, several usability features have been updated and improved. Together, these improvements make antiSMASH up-to-date with the latest developments in natural product research and will further facilitate computational genome mining for the discovery of novel bioactive molecules

Crossref

Wageningen University & Research Publications

Portsmouth University Research Portal (Pure)

Warwick Research Archives Portal Repository

The University of Manchester - Institutional Repository

Online Research Database In Technology

University of Queensland eSpace

MIBiG 3.0 : a community-driven effort to annotate experimentally validated biosynthetic gene clusters

Author: Aguilar César
Al-Salihi Suhad A.A.
Alanjary Mohammad
Aleti Gajender
Augustijn Hannah E.
Avalon Nicole E.
Avelar-Rivas J. Abraham
Avitia-Domínguez Luis A.
Balaya Rex Devasahayam Arokia
Barona-Gómez Francisco
Bernaldo-Agüero Jordan
Bielinski Vincent A.
Biermann Friederike
Blin Kai
Booth Thomas J.
Carrion Bravo Victor J.
Castelo-Branco Raquel
Chagas Fernanda O.
Chevrette Marc G.
Collemare Jérôme
Cruz-Morales Pablo
Du Chao
Duncan Katherine R.
Egbert Susan
Gavriilidou Athina
Gayrard Damien
Gutiérrez-García Karina
Haslinger Kristina
Helfrich Eric J.N.
Jati Afif P.
Kalkreuter Edward
Kalyvas Nikolaos
Kang Kyo B.
Kautsar Satria
Kim Wonyong
Kunjapur Aditya M.
Lee Sanghoon
Li Yong-Xin
Lin Geng-Min
Linington Roger G.
Loureiro Catarina
Louwen Joris J.R.
Louwen Nico L.L.
Lund George
Medema Marnix H.
Meijer David
Navarro-Muñoz Jorge C.
Parra Jonathan
Philmus Benjamin
Pourmohsenin Bita
Pronk Lotte J.U.
Recchia Michael J.J.
Rego Adriana
Reitz Zachary L.
Robinson Serina
Rosas-Becerra L. Rodrigo
Roxborough Eve T.
Schorn Michelle A.
Scobie Darren J.
Selem-Mojica Nelly
Singh Kumar Saurabh
Sokolova Nika
Tang Xiaoyu
Terlouw Barbara R.
Tørring Thomas
Udwary Daniel
van der Hooft Justin J.J.
van Santen Jeffrey A.
Vigneshwari Aruna
Vind Kristiina
Vromans Sophie P.J.M.
Waschulin Valentin
Weber Tilmann
Williams Sam E.
Winter Jaclyn M.
Witte Thomas E.
Xie Huali
Yang Dong
Yu Jingwei
Zaroubi Liana
Zdouc Mitja
Zhong Zheng
Publication venue
Publication date: 18/11/2022
Field of study

With an ever-increasing amount of (meta)genomic data being deposited in sequence databases, (meta)genome mining for natural product biosynthetic pathways occupies a critical role in the discovery of novel pharmaceutical drugs, crop protection agents and biomaterials. The genes that encode these pathways are often organised into biosynthetic gene clusters (BGCs). In 2015, we defined the Minimum Information about a Biosynthetic Gene cluster (MIBiG): a standardised data format that describes the minimally required information to uniquely characterise a BGC. We simultaneously constructed an accompanying online database of BGCs, which has since been widely used by the community as a reference dataset for BGCs and was expanded to 2021 entries in 2019 (MIBiG 2.0). Here, we describe MIBiG 3.0, a database update comprising large-scale validation and re-annotation of existing entries and 661 new entries. Particular attention was paid to the annotation of compound structures and biological activities, as well as protein domain selectivities. Together, these new features keep the database up-to-date, and will provide new opportunities for the scientific community to use its freely available data, e.g. for the training of new machine learning models to predict sequence-structure-function relationships for diverse natural products. MIBiG 3.0 is accessible online at https://mibig.secondarymetabolites.org/

University of Strathclyde Institutional Repository

ZENODO

eScholarship - University of California

Warwick Research Archives Portal Repository

Online Research Database In Technology

Explore Bristol Research

Taxonomic and Metabolic Incongruence in the Ancient Genus Streptomyces.

Author: Chevrette Marc G,
Publication venue
Publication date: 07/06/2021
Field of study

Ezid

On the evolution of natural product biosynthesis

Author: Barona-Gómez Francisco
Chevrette Marc G
Hoskisson Paul A
Publication venue: 'Elsevier BV'
Publication date: 19/04/2023
Field of study

Natural products are the raw material for drug discovery programmes. Bioactive natural products are used extensively in medicine and agriculture and have found utility as antibiotics, immunosuppressives, anti-cancer drugs and anthelminthics. Remarkably, the natural role and what mechanisms drive evolution of these molecules is relatively poorly understood. The exponential increase in genome and chemical data in recent years, coupled with technical advances in bioinformatics and genetics have enabled progress to be made in understanding the evolution of biosynthetic gene clusters and the products of their enzymatic machinery. Here we discuss the diversity of natural products, incorporating the mechanisms that govern evolution of metabolic pathways and how this can be applied to biosynthetic gene clusters. We build on the nomenclature of natural products in terms of primary, integrated, secondary and specialised metabolism and place this within an ecology-evolutionary-developmental biology framework. This eco-evo-devo framework we believe will help to clarify the nature and use of the term specialised metabolites in the future

University of Strathclyde Institutional Repository

SANDPUMA: ensemble predictions of nonribosomal peptide chemistry reveal biosynthetic diversity across Actinobacteria

Author: Aicheler Fabian
Chevrette Marc G.
Currie Cameron R.
Kohlbacher Oliver
Medema M.H.
Publication venue
Publication date: 01/01/2017
Field of study

Nonribosomally synthesized peptides (NRPs) are natural products with widespread applications in medicine and biotechnology. Many algorithms have been developed to predict the substrate specificities of nonribosomal peptide synthetase adenylation (A) domains from DNA sequences, which enables prioritization and dereplication, and integration with other data types in discovery efforts. However, insufficient training data and a lack of clarity regarding prediction quality have impeded optimal use. Here, we introduce prediCAT, a new phylogenetics-inspired algorithm, which quantitatively estimates the degree of predictability of each A-domain. We then systematically benchmarked all algorithms on a newly gathered, independent test set of 434 A-domain sequences, showing that active-site-motif-based algorithms outperform whole-domain-based methods. Subsequently, we developed SANDPUMA, a powerful ensemble algorithm, based on newly trained versions of all high-performing algorithms, which significantly outperforms individual methods. Finally, we deployed SANDPUMA in a systematic investigation of 7635 Actinobacteria genomes, suggesting that NRP chemical diversity is much higher than previously estimated. SANDPUMA has been integrated into the widely used antiSMASH biosynthetic gene cluster analysis pipeline and is also available as an open-source, standalone tool

Wageningen University & Research Publications

MPG.PuRe

Recommended from our members

Taxonomic and Metabolic Incongruence in the Ancient Genus Streptomyces.

Author: Bowen Benjamin P
Carlos-Shanley Camila
Chevrette Marc G
Currie Cameron R
Louie Katherine B
Northen Trent R
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

The advent of culture independent approaches has greatly facilitated insights into the vast diversity of bacteria and the ecological importance they hold in nature and human health. Recently, metagenomic surveys and other culture-independent methods have begun to describe the distribution and diversity of microbial metabolism across environmental conditions, often using 16S rRNA gene as a marker to group bacteria into taxonomic units. However, the extent to which similarity at the conserved ribosomal 16S gene correlates with different measures of phylogeny, metabolic diversity, and ecologically relevant gene content remains contentious. Here, we examine the relationship between 16S identity, core genome divergence, and metabolic gene content across the ancient and ecologically important genus Streptomyces. We assessed and quantified the high variability of average nucleotide identity (ANI) and ortholog presence/absence within Streptomyces, even in strains identical by 16S. Furthermore, we identified key differences in shared ecologically important characters, such as antibiotic resistance, carbohydrate metabolism, biosynthetic gene clusters (BGCs), and other metabolic hallmarks, within 16S identities commonly treated as the same operational taxonomic units (OTUs). Differences between common phylogenetic measures and metabolite-gene annotations confirmed this incongruence. Our results highlight the metabolic diversity and variability within OTUs and add to the growing body of work suggesting 16S-based studies of Streptomyces fail to resolve important ecological and metabolic characteristics

eScholarship - University of California

Biosynthesis and function of 7-deazaguanine derivatives in bacteria and phages

Author: Bruner Steven
Cediel-Becerra José DD
Chevrette Marc G
de Crécy-Lagard Valérie
Hutinet Geoffrey
Jaroch Marshall
Quaiyum Samia
Ratnayake RM Madhushi N
Yuan Yifeng
Zallot Rémi
Publication venue: American Society for Microbiology
Publication date: 29/02/2024
Field of study

Deazaguanine modifications play multifaceted roles in the molecular biology of DNA and tRNA, shaping diverse yet essential biological processes, including the nuanced fine-tuning of translation efficiency and the intricate modulation of codon-anticodon interactions. Beyond their roles in translation, deazaguanine modifications contribute to cellular stress resistance, self-nonself discrimination mechanisms, and host evasion defenses, directly modulating the adaptability of living organisms. Deazaguanine moieties extend beyond nucleic acid modifications, manifesting in the structural diversity of biologically active natural products. Their roles in fundamental cellular processes and their presence in biologically active natural products underscore their versatility and pivotal contributions to the intricate web of molecular interactions within living organisms. Here, we discuss the current understanding of the biosynthesis and multifaceted functions of deazaguanines, shedding light on their diverse and dynamic roles in the molecular landscape of life

E-space: Manchester Metropolitan University's Research Repository

Evolution of combinatorial diversity in trans-acyltransferase polyketide synthase assembly lines across bacteria

Author: Burch Adrien Y.
Chevrette Marc G.
Helfrich Eric J.N.
Hemmerling Franziska
Leopold-Messer Stefan
Lindow Steven E.
Lu Xiaowen
Medema Marnix H.
Minas Hannah A.
Piel Jörn
Ueoka Reiko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Trans-acyltransferase polyketide synthases (trans-AT PKSs) are bacterial multimodular enzymes that biosynthesize diverse pharmaceutically and ecologically important polyketides. A notable feature of this natural product class is the existence of chemical hybrids that combine core moieties from different polyketide structures. To understand the prevalence, biosynthetic basis, and evolutionary patterns of this phenomenon, we developed transPACT, a phylogenomic algorithm to automate global classification of trans-AT PKS modules across bacteria and applied it to 1782 trans-AT PKS gene clusters. These analyses reveal widespread exchange patterns suggesting recombination of extended PKS module series as an important mechanism for metabolic diversification in this natural product class. For three plant-associated bacteria, i.e., the root colonizer Gynuella sunshinyii and the pathogens Xanthomonas cannabis and Pseudomonas syringae, we demonstrate the utility of this computational approach for uncovering cryptic relationships between polyketides, accelerating polyketide mining from fragmented genome sequences, and discovering polyketide variants with conserved moieties of interest. As natural combinatorial hybrids are rare among the more commonly studied cis-AT PKSs, this study paves the way towards evolutionarily informed, rational PKS engineering to produce chimeric trans-AT PKS-derived polyketides.ISSN:2041-172

Repository for Publications and Research Data

Bacillimidazoles A−F, Imidazolium-Containing Compounds Isolated from a Marine Bacillus

Author: Cameron R. Currie
Doug R. Braun
Eric J. N. Helfrich
Gene E. Ananiev
Heino Heyman
Jia-Xuan Yan
Jon Clardy
Marc G. Chevrette
Qihao Wu
Scott R. Rajski
Tim S. Bugni
Publication venue: 'MDPI AG'
Publication date: 01/01/2022
Field of study

Chemical investigations of a marine sponge-associated Bacillus revealed six new imidazolium-containing compounds, bacillimidazoles A–F (1–6). Previous reports of related imidazolium-containing natural products are rare. Initially unveiled by timsTOF (trapped ion mobility spectrometry) MS data, extensive HRMS and 1D and 2D NMR analyses enabled the structural elucidation of 1–6. In addition, a plausible biosynthetic pathway to bacillimidazoles is proposed based on isotopic labeling experiments and invokes the highly reactive glycolytic adduct 2,3-butanedione. Combined, the results of structure elucidation efforts, isotopic labeling studies and bioinformatics suggest that 1–6 result from a fascinating intersection of primary and secondary metabolic pathways in Bacillus sp. WMMC1349. Antimicrobial assays revealed that, of 1–6, only compound six displayed discernible antibacterial activity, despite the close structural similarities shared by all six natural products

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

PubMed Central