561 research outputs found

    Multivariate Analysis in Metabolomics

    Get PDF
    Metabolomics aims to provide a global snapshot of all small-molecule metabolites in cells and biological fluids, free of observational biases inherent to more focused studies of metabolism. However, the staggeringly high information content of such global analyses introduces a challenge of its own; efficiently forming biologically relevant conclusions from any given metabolomics dataset indeed requires specialized forms of data analysis. One approach to finding meaning in metabolomics datasets involves multivariate analysis (MVA) methods such as principal component analysis (PCA) and partial least squares projection to latent structures (PLS), where spectral features contributing most to variation or separation are identified for further analysis. However, as with any mathematical treatment, these methods are not a panacea; this review discusses the use of multivariate analysis for metabolomics, as well as common pitfalls and misconceptions

    Updates in metabolomics tools and resources: 2014-2015

    Get PDF
    Data processing and interpretation represent the most challenging and time-consuming steps in high-throughput metabolomic experiments, regardless of the analytical platforms (MS or NMR spectroscopy based) used for data acquisition. Improved machinery in metabolomics generates increasingly complex datasets that create the need for more and better processing and analysis software and in silico approaches to understand the resulting data. However, a comprehensive source of information describing the utility of the most recently developed and released metabolomics resources—in the form of tools, software, and databases—is currently lacking. Thus, here we provide an overview of freely-available, and open-source, tools, algorithms, and frameworks to make both upcoming and established metabolomics researchers aware of the recent developments in an attempt to advance and facilitate data processing workflows in their metabolomics research. The major topics include tools and researches for data processing, data annotation, and data visualization in MS and NMR-based metabolomics. Most in this review described tools are dedicated to untargeted metabolomics workflows; however, some more specialist tools are described as well. All tools and resources described including their analytical and computational platform dependencies are summarized in an overview Table

    Multivariate Analysis in Metabolomics

    Get PDF
    Metabolomics aims to provide a global snapshot of all small-molecule metabolites in cells and biological fluids, free of observational biases inherent to more focused studies of metabolism. However, the staggeringly high information content of such global analyses introduces a challenge of its own; efficiently forming biologically relevant conclusions from any given metabolomics dataset indeed requires specialized forms of data analysis. One approach to finding meaning in metabolomics datasets involves multivariate analysis (MVA) methods such as principal component analysis (PCA) and partial least squares projection to latent structures (PLS), where spectral features contributing most to variation or separation are identified for further analysis. However, as with any mathematical treatment, these methods are not a panacea; this review discusses the use of multivariate analysis for metabolomics, as well as common pitfalls and misconceptions

    Chronic Kidney Disease Cohort Studies: A Guide to Metabolome Analyses

    Get PDF
    Kidney diseases still pose one of the biggest challenges for global health, and their heterogeneity and often high comorbidity load seriously hinders the unraveling of their underlying pathomechanisms and the delivery of optimal patient care. Metabolomics, the quantitative study of small organic compounds, called metabolites, in a biological specimen, is gaining more and more importance in nephrology research. Conducting a metabolomics study in human kidney disease cohorts, however, requires thorough knowledge about the key workflow steps: study planning, sample collection, metabolomics data acquisition and preprocessing, statistical/bioinformatics data analysis, and results interpretation within a biomedical context. This review provides a guide for future metabolomics studies in human kidney disease cohorts. We will offer an overview of important a priori considerations for metabolomics cohort studies, available analytical as well as statistical/bioinformatics data analysis techniques, and subsequent interpretation of metabolic findings. We will further point out potential research questions for metabolomics studies in the context of kidney diseases and summarize the main results and data availability of important studies already conducted in this field

    Characterisation of the main drivers of intra- and inter- breed variability in the plasma metabolome of dogs

    Get PDF
    INTRODUCTION: Dog breeds are a consequence of artificial selection for specific attributes. These closed genetic populations have metabolic and physiological characteristics that may be revealed by metabolomic analysis. OBJECTIVES: To identify and characterise the drivers of metabolic differences in the fasted plasma metabolome and then determine metabolites differentiating breeds. METHODS: Fasted plasma samples were collected from dogs maintained under two environmental conditions (controlled and client-owned at home). The former (n = 33) consisted of three breeds (Labrador Retriever, Cocker Spaniel and Miniature Schnauzer) fed a single diet batch, the latter (n = 96), client-owned dogs consisted of 9 breeds (Beagle, Chihuahua, Cocker Spaniel, Dachshund, Golden Retriever, Greyhound, German Shepherd, Labrador Retriever and Maltese) consuming various diets under differing feeding regimens. Triplicate samples were taken from Beagle (n = 10) and Labrador Retriever (n = 9) over 3 months. Non-targeted metabolite fingerprinting was performed using flow infusion electrospray-ionization mass spectrometry which was coupled with multivariate data analysis. Metadata factors including age, gender, sexual status, weight, diet and breed were investigated. RESULTS: Breed differences were identified in the plasma metabolome of dogs housed in a controlled environment. Triplicate samples from two breeds identified intra-individual variability, yet breed separation was still observed. The main drivers of variance in dogs maintained in the home environment were associated with breed and gender. Furthermore, metabolite signals were identified that discriminated between Labrador Retriever and Cocker Spaniels in both environments. CONCLUSION: Metabolite fingerprinting of plasma samples can be used to investigate breed differences in client-owned dogs, despite added variance of diet, sexual status and environment. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-016-0997-6) contains supplementary material, which is available to authorized users

    The metaRbolomics Toolbox in Bioconductor and beyond

    Get PDF
    Metabolomics aims to measure and characterise the complex composition of metabolites in a biological system. Metabolomics studies involve sophisticated analytical techniques such as mass spectrometry and nuclear magnetic resonance spectroscopy, and generate large amounts of high-dimensional and complex experimental data. Open source processing and analysis tools are of major interest in light of innovative, open and reproducible science. The scientific community has developed a wide range of open source software, providing freely available advanced processing and analysis approaches. The programming and statistics environment R has emerged as one of the most popular environments to process and analyse Metabolomics datasets. A major benefit of such an environment is the possibility of connecting different tools into more complex workflows. Combining reusable data processing R scripts with the experimental data thus allows for open, reproducible research. This review provides an extensive overview of existing packages in R for different steps in a typical computational metabolomics workflow, including data processing, biostatistics, metabolite annotation and identification, and biochemical network and pathway analysis. Multifunctional workflows, possible user interfaces and integration into workflow management systems are also reviewed. In total, this review summarises more than two hundred metabolomics specific packages primarily available on CRAN, Bioconductor and GitHub

    Bayesian methods for small molecule identification

    Get PDF
    Confident identification of small molecules remains a major challenge in untargeted metabolomics, natural product research and related fields. Liquid chromatography-tandem mass spectrometry is a predominant technique for the high-throughput analysis of small molecules and can detect thousands of different compounds in a biological sample. The automated interpretation of the resulting tandem mass spectra is highly non-trivial and many studies are limited to re-discovering known compounds by searching mass spectra in spectral reference libraries. But these libraries are vastly incomplete and a large portion of measured compounds remains unidentified. This constitutes a major bottleneck in the comprehensive, high-throughput analysis of metabolomics data. In this thesis, we present two computational methods that address different steps in the identification process of small molecules from tandem mass spectra. ZODIAC is a novel method for de novo that is, database-independent molecular formula annotation in complete datasets. It exploits similarities of compounds co-occurring in a sample to find the most likely molecular formula for each individual compound. ZODIAC improves on the currently best-performing method SIRIUS; on one dataset by 16.5 fold. We show that de novo molecular formula annotation is not just a theoretical advantage: We discover multiple novel molecular formulas absent from PubChem, one of the biggest structure databases. Furthermore, we introduce a novel scoring for CSI:FingerID, a state-of-the-art method for searching tandem mass spectra in a structure database. This scoring models dependencies between different molecular properties in a predicted molecular fingerprint via Bayesian networks. This problem has the unusual property, that the marginal probabilities differ for each predicted query fingerprint. Thus, we need to apply Bayesian networks in a novel, non-standard fashion. Modeling dependencies improves on the currently best scoring

    Unravelling the metabolic impact of SBS-associated microbial dysbiosis: Insights from the piglet short bowel syndrome model

    Get PDF
    peer-reviewedLiver disease is a major source of morbidity and mortality in children with short bowel syndrome (SBS). SBS-associated microbial dysbiosis has recently been implicated in the development of SBS-associated liver disease (SBS-ALD), however the pathological implications of this association have not been explored. In this study high-throughput sequencing of colonic content from the well-validated piglet SBS-ALD model was examined to determine alterations in microbial communities, and concurrent metabolic alterations identified in urine samples via targeted mass spectrometry approaches (GC-MS, LC-MS, FIA-MS) further uncovered impacts of microbial disturbance on metabolic outcomes in SBS-ALD. Multi-variate analyses were performed to elucidate contributing SBS-ALD microbe and metabolite panels and to identify microbe-metabolite interactions. A unique SBS-ALD microbe panel was clearest at the genus level, with discriminating bacteria predominantly from the Firmicutes and Bacteroidetes phyla. The SBS-ALD metabolome included important alterations in the microbial metabolism of amino acids and the mitochondrial metabolism of branched chain amino acids. Correlation analysis defined microbe-metabolite clustering patterns unique to SBS-ALD and identified a metabolite panel that correlates with dysbiosis of the gut microbiome in SBS

    The metaRbolomics Toolbox in Bioconductor and beyond

    Get PDF
    Metabolomics aims to measure and characterise the complex composition of metabolites in a biological system. Metabolomics studies involve sophisticated analytical techniques such as mass spectrometry and nuclear magnetic resonance spectroscopy, and generate large amounts of high-dimensional and complex experimental data. Open source processing and analysis tools are of major interest in light of innovative, open and reproducible science. The scientific community has developed a wide range of open source software, providing freely available advanced processing and analysis approaches. The programming and statistics environment R has emerged as one of the most popular environments to process and analyse Metabolomics datasets. A major benefit of such an environment is the possibility of connecting different tools into more complex workflows. Combining reusable data processing R scripts with the experimental data thus allows for open, reproducible research. This review provides an extensive overview of existing packages in R for different steps in a typical computational metabolomics workflow, including data processing, biostatistics, metabolite annotation and identification, and biochemical network and pathway analysis. Multifunctional workflows, possible user interfaces and integration into workflow management systems are also reviewed. In total, this review summarises more than two hundred metabolomics specific packages primarily available on CRAN, Bioconductor and GitHub
    • …
    corecore