54 research outputs found

    Ms2lda.org: web-based topic modelling for substructure discovery in mass spectrometry

    Get PDF
    Motivation: We recently published MS2LDA, a method for the decomposition of sets of molecular fragment data derived from large metabolomics experiments. To make the method more widely available to the community, here we present ms2lda.org, a web application that allows users to upload their data, run MS2LDA analyses and explore the results through interactive visualisations. Results: Ms2lda.org takes tandem mass spectrometry data in many standard formats and allows the user to infer the sets of fragment and neutral loss features that co-occur together (Mass2Motifs). As an alternative workflow, the user can also decompose a dataset onto predefined Mass2Motifs. This is accomplished through the web interface or programmatically from our web service

    In silico optimization of mass spectrometry fragmentation strategies in metabolomics

    Get PDF
    Liquid chromatography (LC) coupled to tandem mass spectrometry (MS/MS) is widely used in identifying small molecules in untargeted metabolomics. Various strategies exist to acquire MS/MS fragmentation spectra; however, the development of new acquisition strategies is hampered by the lack of simulators that let researchers prototype, compare, and optimize strategies before validations on real machines. We introduce Virtual Metabolomics Mass Spectrometer (ViMMS), a metabolomics LC-MS/MS simulator framework that allows for scan-level control of the MS2 acquisition process in silico. ViMMS can generate new LC-MS/MS data based on empirical data or virtually re-run a previous LC-MS/MS analysis using pre-existing data to allow the testing of different fragmentation strategies. To demonstrate its utility, we show how ViMMS can be used to optimize N for Top-N data-dependent acquisition (DDA) acquisition, giving results comparable to modifying N on the mass spectrometer. We expect that ViMMS will save method development time by allowing for offline evaluation of novel fragmentation strategies and optimization of the fragmentation strategy for a particular experiment

    Deciphering complex metabolite mixtures by unsupervised and supervised substructure discovery and semi-automated annotation from MS/MS spectra

    Get PDF
    Complex metabolite mixtures are challenging to unravel. Mass spectrometry (MS) is a widely used and sensitive technique to obtain structural information on complex mixtures. However, just knowing the molecular masses of the mixture’s constituents is almost always insufficient for confident assignment of the associated chemical structures. Structural information can be augmented through MS fragmentation experiments whereby detected metabolites are fragmented giving rise to MS/MS spectra. However, how can we maximize the structural information we gain from fragmentation spectra? We recently proposed a substructure-based strategy to enhance metabolite annotation for complex mixtures by considering metabolites as the sum of (bio)chemically relevant moieties that we can detect through mass spectrometry fragmentation approaches. Our MS2LDA tool allows us to discover - unsupervised - groups of mass fragments and/or neutral losses termed Mass2Motifs that often correspond to substructures. After manual annotation, these Mass2Motifs can be used in subsequent MS2LDA analyses of new datasets, thereby providing structural annotations for many molecules that are not present in spectral databases. Here, we describe how additional strategies, taking advantage of i) combinatorial in-silico matching of experimental mass features to substructures of candidate molecules, and ii) automated machine learning classification of molecules, can facilitate semi-automated annotation of substructures. We show how our approach accelerates the Mass2Motif annotation process and therefore broadens the chemical space spanned by characterized motifs. Our machine learning model used to classify fragmentation spectra learns the relationships between fragment spectra and chemical features. Classification prediction on these features can be aggregated for all molecules that contribute to a particular Mass2Motif and guide Mass2Motif annotations. To make annotated Mass2Motifs available to the community, we also present motifDB: an open database of Mass2Motifs that can be browsed and accessed programmatically through an Application Programming Interface (API). MotifDB is integrated within ms2lda.org, allowing users to efficiently search for characterized motifs in their own experiments. We expect that with an increasing number of Mass2Motif annotations available through a growing database we can more quickly gain insight in the constituents of complex mixtures. That will allow prioritization towards novel or unexpected chemistries and faster recognition of known biochemical building blocks

    Mass spectral molecular networking to profile the metabolome of biostimulant bacillus strains

    Get PDF
    Beneficial soil microbes like plant growth-promoting rhizobacteria (PGPR) significantly contribute to plant growth and development through various mechanisms activated by plant-PGPR interactions. However, a complete understanding of the biochemistry of the PGPR and microbial intraspecific interactions within the consortia is still enigmatic. Such complexities constrain the design and use of PGPR formulations for sustainable agriculture. Therefore, we report the application of mass spectrometry (MS)-based untargeted metabolomics and molecular networking (MN) to interrogate and profile the intracellular chemical space of PGPR Bacillus strains: B. laterosporus, B. amyloliquefaciens, B. licheniformis 1001, and B. licheniformis M017 and their consortium. The results revealed differential and diverse chemistries in the four Bacillus strains when grown separately, and also differing from when grown as a consortium. MolNetEnhancer networks revealed 11 differential molecular families that are comprised of lipids and lipid-like molecules, benzenoids, nucleotide-like molecules, and organic acids and derivatives. Consortium and B. amyloliquefaciens metabolite profiles were characterized by the high abundance of surfactins, whereas B. licheniformis strains were characterized by the unique presence of lichenysins. Thus, this work, applying metabolome mining tools, maps the microbial chemical space of isolates and their consortium, thus providing valuable insights into molecular information of microbial systems. Such fundamental knowledge is essential for the innovative design and use of PGPR-based biostimulants

    Automatic metabolite annotation in complex LC-MS(n ≥ 2) data using MAGMa

    Get PDF
    Poster presented at the Analytical Tools for Cutting-edge Metabolomics meeting in London, 30 April 201

    Comparative metabologenomics analysis of polar actinomycetes.

    Get PDF
    Biosynthetic and chemical datasets are the two major pillars for microbial drug discovery in the omics era. Despite the advancement of analysis tools and platforms for multi-strain metabolomics and genomics, linking these information sources remains a considerable bottleneck in strain prioritisation and natural product discovery. In this study, molecular networking of the 100 metabolite extracts derived from applying the OSMAC approach to 25 Polar bacterial strains, showed growth media specificity and potential chemical novelty was suggested. Moreover, the metabolite extracts were screened for antibacterial activity and promising selective bioactivity against drug-persistent pathogens such as Klebsiella pneumoniae and Acinetobacter baumannii was observed. Genome sequencing data were combined with metabolomics experiments in the recently developed computational approach, NPLinker, which was used to link BGC and molecular features to prioritise strains for further investigation based on biosynthetic and chemical information. Herein, we putatively identified the known metabolites ectoine and chrloramphenicol which, through NPLinker, were linked to their associated BGCs. The metabologenomics approach followed in this study can potentially be applied to any large microbial datasets for accelerating the discovery of new (bioactive) specialised metabolites

    Rapid development of improved data-dependent acquisition strategies

    Get PDF
    Tandem mass spectrometry (LC-MS/MS) is widely used to identify unknown ions in untargeted metabolomics. Data-dependent acquisition (DDA) chooses which ions to fragment based upon intensities observed in MS1 survey scans and typically only fragments a small subset of the ions present. Despite this inefficiency, relatively little work has addressed the development of new DDA methods, partly due to the high overhead associated with running the many extracts necessary to optimize approaches in busy MS facilities. In this work, we first provide theoretical results that show how much improvement is possible over current DDA strategies. We then describe an in silico framework for fast and cost-efficient development of new DDA strategies using a previously developed virtual metabolomics mass spectrometer (ViMMS). Additional functionality is added to ViMMS to allow methods to be used both in simulation and on real samples via an Instrument Application Programming Interface (IAPI). We demonstrate this framework through the development and optimization of two new DDA methods that introduce new advanced ion prioritization strategies. Upon application of these developed methods to two complex metabolite mixtures, our results show that they are able to fragment more unique ions than standard DDA strategies

    Advances in decomposing complex metabolite mixtures using substructure- and network-based computational metabolomics approaches

    Get PDF
    Covering: up to the end of 2020 Recently introduced computational metabolome mining tools have started to positively impact the chemical and biological interpretation of untargeted metabolomics analyses. We believe that these current advances make it possible to start decomposing complex metabolite mixtures into substructure and chemical class information, thereby supporting pivotal tasks in metabolomics analysis including metabolite annotation, the comparison of metabolic profiles, and network analyses. In this review, we highlight and explain key tools and emerging strategies covering 2015 up to the end of 2020. The majority of these tools aim at processing and analyzing liquid chromatography coupled to mass spectrometry fragmentation data. We start with defining what substructures are, how they relate to molecular fingerprints, and how recognizing them helps to decompose complex mixtures. We continue with chemical classes that are based on the presence or absence of particular molecular scaffolds and/or functional groups and are thus intrinsically related to substructures. We discuss novel tools to mine substructures, annotate chemical compound classes, and create mass spectral networks from metabolomics data and demonstrate them using two case studies. We also review and speculate about the opportunities that NMR spectroscopy-based metabolome mining of complex metabolite mixtures offers to discover substructures and chemical classes. Finally, we will describe the main benefits and limitations of the current tools and strategies that rely on them, and our vision on how this exciting field can develop toward repository-scale-sized metabolomics analyses. Complementary sources of structural information from genomics analyses and well-curated taxonomic records are also discussed. Many research fields such as natural products discovery, pharmacokinetic and drug metabolism studies, and environmental metabolomics increasingly rely on untargeted metabolomics to gain biochemical and biological insights. The here described technical advances will benefit all those metabolomics disciplines by transforming spectral data into knowledge that can answer biological questions

    Comprehensive mass spectrometry-guided phenotyping of plant specialized metabolites reveals metabolic diversity in the cosmopolitan plant family Rhamnaceae

    Get PDF
    Plants produce a myriad of specialized metabolites to overcome their sessile habit and combat biotic as well as abiotic stresses. Evolution has shaped the diversity of specialized metabolites, which then drives many other aspects of plant biodiversity. However, until recently, large-scale studies investigating the diversity of specialized metabolites in an evolutionary context have been limited by the impossibility of identifying chemical structures of hundreds to thousands of compounds in a time-feasible manner. Here we introduce a workflow for large-scale, semi-automated annotation of specialized metabolites and apply it to over 1000 metabolites of the cosmopolitan plant family Rhamnaceae. We enhance the putative annotation coverage dramatically, from 2.5% based on spectral library matches alone to 42.6% of total MS/MS molecular features, extending annotations from well-known plant compound classes into dark plant metabolomics. To gain insights into substructural diversity within this plant family, we also extract patterns of co-occurring fragments and neutral losses, so-called Mass2Motifs, from the dataset; for example, only the Ziziphoid clade developed the triterpenoid biosynthetic pathway, whereas the Rhamnoid clade predominantly developed diversity in flavonoid glycosides, including 7-O-methyltransferase activity. Our workflow provides the foundations for the automated, high-throughput chemical identification of massive metabolite spaces, and we expect it to revolutionize our understanding of plant chemoevolutionary mechanisms.</p

    Assessing specialized metabolite diversity in the cosmopolitan plant genus Euphorbia l.

    Get PDF
    Coevolutionary theory suggests that an arms race between plants and herbivores yields increased plant specialized metabolite diversity and the geographic mosaic theory of coevolution predicts that coevolutionary interactions vary across geographic scales. Consequently, plant specialized metabolite diversity is expected to be highest in coevolutionary hotspots, geographic regions, which exhibit strong reciprocal selection on the interacting species. Despite being well-established theoretical frameworks, technical limitations have precluded rigorous hypothesis testing. Here we aim at understanding how geographic separation over evolutionary time may have impacted chemical differentiation in the cosmopolitan plant genus Euphorbia. We use a combination of state-of-the-art computational mass spectral metabolomics tools together with cell-based high-throughput immunomodulatory testing. Our results show significant differences in specialized metabolite diversity across geographically separated phylogenetic clades. Chemical structural diversity of the highly toxic Euphorbia diterpenoids is significantly reduced in species native to the Americas, compared to Afro-Eurasia. The localization of these compounds to young stems and roots suggest a possible ecological relevance in herbivory defense. This is further supported by reduced immunomodulatory activity in the American subclade as well as herbivore distribution patterns. We conclude that computational mass spectrometric metabolomics coupled with relevant ecological data provide a strong tool for exploring plant specialized metabolite diversity in a chemo-evolutionary framework
    • …
    corecore