117 research outputs found

    MetAssimulo:Simulation of Realistic NMR Metabolic Profiles

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Probing the complex fusion of genetic and environmental interactions, metabolic profiling (or metabolomics/metabonomics), the study of small molecules involved in metabolic reactions, is a rapidly expanding 'omics' field. A major technique for capturing metabolite data is <sup>1</sup>H-NMR spectroscopy and this yields highly complex profiles that require sophisticated statistical analysis methods. However, experimental data is difficult to control and expensive to obtain. Thus data simulation is a productive route to aid algorithm development.</p> <p>Results</p> <p>MetAssimulo is a MATLAB-based package that has been developed to simulate <sup>1</sup>H-NMR spectra of complex mixtures such as metabolic profiles. Drawing data from a metabolite standard spectral database in conjunction with concentration information input by the user or constructed automatically from the Human Metabolome Database, MetAssimulo is able to create realistic metabolic profiles containing large numbers of metabolites with a range of user-defined properties. Current features include the simulation of two groups ('case' and 'control') specified by means and standard deviations of concentrations for each metabolite. The software enables addition of spectral noise with a realistic autocorrelation structure at user controllable levels. A crucial feature of the algorithm is its ability to simulate both intra- and inter-metabolite correlations, the analysis of which is fundamental to many techniques in the field. Further, MetAssimulo is able to simulate shifts in NMR peak positions that result from matrix effects such as pH differences which are often observed in metabolic NMR spectra and pose serious challenges for statistical algorithms.</p> <p>Conclusions</p> <p>No other software is currently able to simulate NMR metabolic profiles with such complexity and flexibility. This paper describes the algorithm behind MetAssimulo and demonstrates how it can be used to simulate realistic NMR metabolic profiles with which to develop and test new data analysis techniques. MetAssimulo is freely available for academic use at <url>http://cisbic.bioinformatics.ic.ac.uk/metassimulo/</url>.</p

    Extraction and Integration of Genetic Networks from Short-Profile Omic Data Sets.

    Get PDF
    Mass spectrometry technologies are widely used in the fields of ionomics and metabolomics to simultaneously profile the intracellular concentrations of, e.g., amino acids or elements in genome-wide mutant libraries. These molecular or sub-molecular features are generally non-Gaussian and their covariance reveals patterns of correlations that reflect the system nature of the cell biochemistry and biology. Here, we introduce two similarity measures, the Mahalanobis cosine and the hybrid Mahalanobis cosine, that enforce information from the empirical covariance matrix of omics data from high-throughput screening and that can be used to quantify similarities between the profiled features of different mutants. We evaluate the performance of these similarity measures in the task of inferring and integrating genetic networks from short-profile ionomics/metabolomics data through an analysis of experimental data sets related to the ionome and the metabolome of the model organism S. cerevisiae. The study of the resulting ionome-metabolome Saccharomyces cerevisiae multilayer genetic network, which encodes multiple omic-specific levels of correlations between genes, shows that the proposed measures can provide an alternative description of relations between biological processes when compared to the commonly used Pearson's correlation coefficient and have the potential to guide the construction of novel hypotheses on the function of uncharacterised genes

    Multiple-testing correction in metabolome-wide association studies.

    Get PDF
    BACKGROUND: The search for statistically significant relationships between molecular markers and outcomes is challenging when dealing with high-dimensional, noisy and collinear multivariate omics data, such as metabolomic profiles. Permutation procedures allow for the estimation of adjusted significance levels without assuming independence among metabolomic variables. Nevertheless, the complex non-normal structure of metabolic profiles and outcomes may bias the permutation results leading to overly conservative threshold estimates i.e. lower than those from a Bonferroni or Sidak correction. METHODS: Within a univariate permutation procedure we employ parametric simulation methods based on the multivariate (log-)Normal distribution to obtain adjusted significance levels which are consistent across different outcomes while effectively controlling the type I error rate. Next, we derive an alternative closed-form expression for the estimation of the number of non-redundant metabolic variates based on the spectral decomposition of their correlation matrix. The performance of the method is tested for different model parametrizations and across a wide range of correlation levels of the variates using synthetic and real data sets. RESULTS: Both the permutation-based formulation and the more practical closed form expression are found to give an effective indication of the number of independent metabolic effects exhibited by the system, while guaranteeing that the derived adjusted threshold is stable across outcome measures with diverse properties

    Identification of prediagnostic metabolites associated with prostate cancer risk by untargeted mass spectrometry-based metabolomics: A case-control study nested in the Northern Sweden Health and Disease Study

    Get PDF
    Prostate cancer (PCa) is the most common cancer form in males in many European and American countries, but there are still open questions regarding its etiology. Untargeted metabolomics can produce an unbiased global metabolic profile, with the opportunity for uncovering new plasma metabolites prospectively associated with risk of PCa, providing insights into disease etiology. We conducted a prospective untargeted liquid chromatography-mass spectrometry (LC-MS) metabolomics analysis using prediagnostic fasting plasma samples from 752 PCa case-control pairs nested within the Northern Sweden Health and Disease Study (NSHDS). The pairs were matched by age, BMI, and sample storage time. Discriminating features were identified by a combination of orthogonal projection to latent structures-effect projections (OPLS-EP) and Wilcoxon signed-rank tests. Their prospective associations with PCa risk were investigated by conditional logistic regression. Subgroup analyses based on stratification by disease aggressiveness and baseline age were also conducted. Various free fatty acids and phospholipids were positively associated with overall risk of PCa and in various stratification subgroups. Aromatic amino acids were positively associated with overall risk of PCa. Uric acid was positively, and glucose negatively, associated with risk of PCa in the older subgroup. This is the largest untargeted LC-MS based metabolomics study to date on plasma metabolites prospectively associated with risk of developing PCa. Different subgroups of disease aggressiveness and baseline age showed different associations with metabolites. The findings suggest that shifts in plasma concentrations of metabolites in lipid, aromatic amino acid, and glucose metabolism are associated with risk of developing PCa during the following two decades

    BATMAN-an R package for the automated quantification of me- tabolites from NMR spectra using a Bayesian Model

    Get PDF
    ABSTRACT Motivation: NMR spectra are widely used in metabolomics to obtain metabolite profiles in complex biological mixtures. Common methods used to assign and estimate concentrations of metabolites involve either an expert manual peak fitting or extra pre-processing steps, such as peak alignment and binning. Peak fitting is very time consuming and is subject to human error. Conversely, alignment and binning can introduce artefacts and limit immediate biological interpretation of models. Results: We present the Bayesian AuTomated Metabolite Analyser for NMR spectra (BATMAN), an R package which deconvolutes peaks from 1-dimensional NMR spectra, automatically assigns them to specific metabolites from a target list and obtains concentration estimates. The Bayesian model incorporates information on characteristic peak patterns of metabolites and is able to account for shifts in the position of peaks commonly seen in NMR spectra of biological samples. It applies a Markov Chain Monte Carlo (MCMC) algorithm to sample from a joint posterior distribution of the model parameters and obtains concentration estimates with reduced error compared with conventional numerical integration and comparable to manual deconvolution by experienced spectroscopists

    Correlation Network Analysis reveals a sequential reorganization of metabolic and transcriptional states during germination and gene-metabolite relationships in developing seedlings of Arabidopsis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Holistic profiling and systems biology studies of nutrient availability are providing more and more insight into the mechanisms by which gene expression responds to diverse nutrients and metabolites. Less is known about the mechanisms by which gene expression is affected by endogenous metabolites, which can change dramatically during development. Multivariate statistics and correlation network analysis approaches were applied to non-targeted profiling data to investigate transcriptional and metabolic states and to identify metabolites potentially influencing gene expression during the heterotrophic to autotrophic transition of seedling establishment.</p> <p>Results</p> <p>Microarray-based transcript profiles were obtained from extracts of Arabidopsis seeds or seedlings harvested from imbibition to eight days-old. <sup>1</sup>H-NMR metabolite profiles were obtained for corresponding samples. Analysis of transcript data revealed high differential gene expression through seedling emergence followed by a period of less change. Differential gene expression increased gradually to day 8, and showed two days, 5 and 7, with a very high proportion of up-regulated genes, including transcription factor/signaling genes. Network cartography using spring embedding revealed two primary clusters of highly correlated metabolites, which appear to reflect temporally distinct metabolic states. Principle Component Analyses of both sets of profiling data produced a chronological spread of time points, which would be expected of a developmental series. The network cartography of the transcript data produced two distinct clusters comprising days 0 to 2 and days 3 to 8, whereas the corresponding analysis of metabolite data revealed a shift of day 2 into the day 3 to 8 group. A metabolite and transcript pair-wise correlation analysis encompassing all time points gave a set of 237 highly significant correlations. Of 129 genes correlated to sucrose, 44 of them were known to be sucrose responsive including a number of transcription factors.</p> <p>Conclusions</p> <p>Microarray analysis during germination and establishment revealed major transitions in transcriptional activity at time points potentially associated with developmental transitions. Network cartography using spring-embedding indicate that a shift in the state of nutritionally important metabolites precedes a major shift in the transcriptional state going from germination to seedling emergence. Pair-wise linear correlations of transcript and metabolite levels identified many genes known to be influenced by metabolites, and provided other targets to investigate metabolite regulation of gene expression during seedling establishment.</p

    B stars as a diagnostic of star-formation at low and high redshift

    Get PDF
    We have extended the evolutionary synthesis models by Leitherer et al. (1999b) by including a new library of B stars generated from the IUE high-dispersion spectra archive. We present the library and show how the stellar spectral properties vary according to luminosity classes and spectral types. We have generated synthetic UV spectra for prototypical young stellar populations varying the IMF and the star formation law. Clear signs of age effects are seen in all models. The contribution of B stars in the UV line spectrum is clearly detected, in particular for greater ages when O stars have evolved. With the addition of the new library we are able to investigate the fraction of stellar and interstellar contributions and the variation in the spectral shapes of intense lines. We have used our models to date the spectrum of the local super star cluster NGC1705-1. Photospheric lines of CIII1247, SiIII1417, and SV1502 were used as diagnostics to date the burst of NGC 1705-1 at 10 Myr. We have selected the star-forming galaxy 1512-cB58 as a first application of the new models to high-z galaxies. This galaxy is at z=2.723, it is gravitationally lensed, and its high signal-to-noise Keck spectrum show features typical of local starburst galaxies, such as NGC 1705-1. Models with continuous star formation were found to be more adequate for 1512-cB58 since there are spectral features typical of a composite stellar population of O and B stars. A model with Z =0.4Z_solar and an IMF with alpha=2.8 reproduces the stellar features of the 1512-cB58 spectrum.Comment: 23 pages with figures, see http://sol.stsci.edu/~demello/welcomeb.htm

    Synergistic and Antagonistic Mutation Responses of Human MCL-5 Cells to Mixtures of Benzo[a]pyrene and 2-Amino-1-Methyl-6-Phenylimidazo[4,5-b]pyridine: Dose-Related Variation in the Joint Effects of Common Dietary Carcinogens.

    No full text
    BACKGROUND: Chemical carcinogens such as benzo[a]pyrene (BaP) and 2-amino-1-methyl-6-phenylimidazo[4,5-b]pyridine (PhIP) may contribute to the etiology of human diet-associated cancer. Individually, these compounds are genotoxic, but the consequences of exposure to mixtures of these chemicals have not been systematically examined. OBJECTIVES: We determined the mutagenic response to mixtures of BaP and PhIP at concentrations relevant to human exposure (micromolar to subnanomolar). METHODS: Human MCL-5 cells (metabolically competent) were exposed to BaP or PhIP individually or in mixtures. Mutagenicity was assessed at the thymidine kinase (TK) locus, CYP1A activity was determined by ethoxyresorufin-O-deethylase (EROD) activity and qRT-PCR, and cell cycle was measured by flow cytometry. RESULTS: Mixtures of BaP and PhIP produced dose responses different from those of the individual chemicals; we observed remarkably increased mutant frequency (MF) at lower concentrations of the mixtures (not mutagenic individually), and decreased MF at higher concentrations of the mixtures, than the calculated predicted additive MF of the individual chemicals. EROD activity and CYP1A1 mRNA levels were correlated with TK MF, supporting involvement of the CYP1A family in mutation. Moreover, a cell cycle G2/M phase block was observed at high-dose combinations, consistent with DNA damage sensing and repair. CONCLUSIONS: Mixtures of these genotoxic chemicals produced mutation responses that differed from those expected for the additive effects of the individual chemicals. The increase in MF for certain combinations of chemicals at low concentrations that were not genotoxic for the individual chemicals, as well as the nonmonotonic dose response, may be important for understanding the mutagenic potential of food and the etiology of diet-associated cancers. CITATION: David R, Ebbels T, Gooderham N. 2016. Synergistic and antagonistic mutation responses of human MCL-5 cells to mixtures of benzo[a]pyrene and 2-amino-1-methyl-6-phenylimidazo[4,5-b]pyridine: dose-related variation in the joint effects of common dietary carcinogens. Environ Health Perspect 124:88–96; http://dx.doi.org/10.1289/ehp.140955
    corecore