Search CORE

Integrating quantitative proteomics and metabolomics with a genome-scale metabolic network model

Author: Akesson
Apic
Becker
Bennett
Blank
Cakir
E. Ruppin
Feist
Feist
Fell
Fong
Jamshidi
K. Yizhak
Kauffman
Lee
Pharkya
Price
Rantanen
Shlomi
Shlomi
T. Benyamini
T. Shlomi
Tuller
W. Liebermeister
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: The availability of modern sequencing techniques has led to a rapid increase in the amount of reconstructed metabolic networks. Using these models as a platform for the analysis of high throughput transcriptomic, proteomic and metabolomic data can provide valuable insight into conditional changes in the metabolic activity of an organism. While transcriptomics and proteomics provide important insights into the hierarchical regulation of metabolic flux, metabolomics shed light on the actual enzyme activity through metabolic regulation and mass action effects. Here we introduce a new method, termed integrative omics-metabolic analysis (IOMA) that quantitatively integrates proteomic and metabolomic data with genome-scale metabolic models, to more accurately predict metabolic flux distributions. The method is formulated as a quadratic programming (QP) problem that seeks a steady-state flux distribution in which flux through reactions with measured proteomic and metabolomic data, is as consistent as possible with kinetically derived flux estimations

Evaluation of rate law approximations in bottom-up kinetic models of metabolism.

Author: A Bordbar
A Dräger
A Flamholz
A Joshi
A Kinoshita
AB Canelas
Andreas Dräger
AR Tzafriri
BD Bennett
Bernhard O. Palsson
Bin Du
BO Palsson
BO Palsson
C Chassagnole
D Visser
D Visser
DA Fell
Daniel C. Zielinski
E Noor
Erol S. Kavvas
F Hadlich
Garri A. Arzumanyan
GE Briggs
J Almquist
J Schellenberger
JJ Heijnen
JR Karr
Justin Tan
K Smallbone
K Takahashi
Kayla E. Ruggiero
KR Sanft
LA Segel
LA Segel
M Bier
M Salter
M Scheer
N Jamshidi
N Jamshidi
N Jamshidi
N Zamboni
O Kotte
PJ Mulquiney
R Grima
S Schnell
SA Becker
U Alon
W Liebermeister
Y-Y Liu
Zhen Zhang
Publication venue: eScholarship, University of California
Publication date: 01/01/2016
Field of study

BackgroundThe mechanistic description of enzyme kinetics in a dynamic model of metabolism requires specifying the numerical values of a large number of kinetic parameters. The parameterization challenge is often addressed through the use of simplifying approximations to form reaction rate laws with reduced numbers of parameters. Whether such simplified models can reproduce dynamic characteristics of the full system is an important question.ResultsIn this work, we compared the local transient response properties of dynamic models constructed using rate laws with varying levels of approximation. These approximate rate laws were: 1) a Michaelis-Menten rate law with measured enzyme parameters, 2) a Michaelis-Menten rate law with approximated parameters, using the convenience kinetics convention, 3) a thermodynamic rate law resulting from a metabolite saturation assumption, and 4) a pure chemical reaction mass action rate law that removes the role of the enzyme from the reaction kinetics. We utilized in vivo data for the human red blood cell to compare the effect of rate law choices against the backdrop of physiological flux and concentration differences. We found that the Michaelis-Menten rate law with measured enzyme parameters yields an excellent approximation of the full system dynamics, while other assumptions cause greater discrepancies in system dynamic behavior. However, iteratively replacing mechanistic rate laws with approximations resulted in a model that retains a high correlation with the true model behavior. Investigating this consistency, we determined that the order of magnitude differences among fluxes and concentrations in the network were greatly influential on the network dynamics. We further identified reaction features such as thermodynamic reversibility, high substrate concentration, and lack of allosteric regulation, which make certain reactions more suitable for rate law approximations.ConclusionsOverall, our work generally supports the use of approximate rate laws when building large scale kinetic models, due to the key role that physiologically meaningful flux and concentration ranges play in determining network dynamics. However, we also showed that detailed mechanistic models show a clear benefit in prediction accuracy when data is available. The work here should help to provide guidance to future kinetic modeling efforts on the choice of rate law and parameterization approaches

eScholarship - University of California

Online Research Database In Technology

The Escherichia coli transcriptome mostly consists of independently regulated modules

Author: A Anand
A Biton
A Delorme
A Frigyesi
A Hyvärinen
A Santos-Zavaleta
A-M Martoglio
AE Teschendorff
B Dalrymple
B Langmead
B-K Cho
B-K Cho
BM Bolstad
C Vijayendran
CL Turnbough Jr
D Kim
D Marbach
D Risso
D-S Huang
DS Latchman
E Nudler
EJ O’Brien
ENCODE Project Consortium.
ER Gansner
F Pedregosa
GI Guzmán
GI Guzmán
H Zou
HS Rhee
I Kristoficova
IM Keseler
J Pouyssegur
J Utrilla
JE Galagan
JJ Faith
JM Buescher
JM Engreitz
JM Monk
JT Leek
K Valgepea
K-K Yan
KF Jensen
KJ Karczewski
L Wang
M Ester
M Kim
M Lawrence
M Moretto
M Scott
M Scott
MB Gerstein
MI Love
NE Lewis
O Alter
P Chiappetta
P Comon
PR Subbarayan
PV Phaneuf
R De Smet
R Kolter
RA LaCroix
RB D’agostino
S Gama-Castro
S Lin
SJ Larsen
SW Seo
T Baba
T Barrett
TM Henkin
W Kong
W Liebermeister
W Saelens
X Zhang
Xin Fang
XW Zhang
Y Gao
Y Yamanaka
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Underlying cellular responses is a transcriptional regulatory network (TRN) that modulates gene expression. A useful description of the TRN would decompose the transcriptome into targeted effects of individual transcriptional regulators. Here, we apply unsupervised machine learning to a diverse compendium of over 250 high-quality Escherichia coli RNA-seq datasets to identify 92 statistically independent signals that modulate the expression of specific gene sets. We show that 61 of these transcriptomic signals represent the effects of currently characterized transcriptional regulators. Condition-specific activation of signals is validated by exposure of E. coli to new environmental conditions. The resulting decomposition of the transcriptome provides: a mechanistic, systems-level, network-based explanation of responses to environmental and genetic perturbations; a guide to gene and regulator function discovery; and a basis for characterizing transcriptomic differences in multiple strains. Taken together, our results show that signal summation describes the composition of a model prokaryotic transcriptome

eScholarship - University of California

ScholarWorks@UNIST

Online Research Database In Technology

Factor analysis for gene regulatory networks and transcription factor activity profiles

Author: A Frigyesi
A Utsugi
AL Boulesteix
AM Martoglio
C Sabatti
C Sabatti
E Fokoue
G Hinton
H Kaiser
H Ming
H Salgado
I Pournara
Iosifina Pournara
J Liao
K Kao
L Tran
Lorenz Wernisch
M Tipping
M West
O Aguilar
P Schönemann
W Liebermeister
Z Ghahramani
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Most existing algorithms for the inference of the structure of gene regulatory networks from gene expression data assume that the activity levels of transcription factors (TFs) are proportional to their mRNA levels. This assumption is invalid for most biological systems. However, one might be able to reconstruct unobserved activity profiles of TFs from the expression profiles of target genes. A simple model is a two-layer network with unobserved TF variables in the first layer and observed gene expression variables in the second layer. TFs are connected to regulated genes by weighted edges. The weights, known as factor loadings, indicate the strength and direction of regulation. Of particular interest are methods that produce sparse networks, networks with few edges, since it is known that most genes are regulated by only a small number of TFs, and most TFs regulate only a small number of genes. RESULTS: In this paper, we explore the performance of five factor analysis algorithms, Bayesian as well as classical, on problems with biological context using both simulated and real data. Factor analysis (FA) models are used in order to describe a larger number of observed variables by a smaller number of unobserved variables, the factors, whereby all correlation between observed variables is explained by common factors. Bayesian FA methods allow one to infer sparse networks by enforcing sparsity through priors. In contrast, in the classical FA, matrix rotation methods are used to enforce sparsity and thus to increase the interpretability of the inferred factor loadings matrix. However, we also show that Bayesian FA models that do not impose sparsity through the priors can still be used for the reconstruction of a gene regulatory network if applied in conjunction with matrix rotation methods. Finally, we show the added advantage of merging the information derived from all algorithms in order to obtain a combined result. CONCLUSION: Most of the algorithms tested are successful in reconstructing the connectivity structure as well as the TF profiles. Moreover, we demonstrate that if the underlying network is sparse it is still possible to reconstruct hidden activity profiles of TFs to some degree without prior connectivity information

Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data

Author: A Frigyesi
A Hyvärinen
A Pascual-Montano
AE Teschendorff
AM Martoglio
CY Tsai
DD Lee
EA Fernandez
EM Blalock
EM Blalock
G Hori
H Turner
JC Patra
K Stadlthanner
L Zhu
PO Hoyer
Q Gu
RE Suri
RM Suresh
S Seal
SA Saidi
W Liebermeister
W Liu
Wei Kong
Xiaohua Hu
Xiaoyang Mou
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The wide use of high-throughput DNA microarray technology provide an increasingly detailed view of human transcriptome from hundreds to thousands of genes. Although biomedical researchers typically design microarray experiments to explore specific biological contexts, the relationships between genes are hard to identified because they are complex and noisy high-dimensional data and are often hindered by low statistical power. The main challenge now is to extract valuable biological information from the colossal amount of data to gain insight into biological processes and the mechanisms of human disease. To overcome the challenge requires mathematical and computational methods that are versatile enough to capture the underlying biological features and simple enough to be applied efficiently to large datasets. Methods Unsupervised machine learning approaches provide new and efficient analysis of gene expression profiles. In our study, two unsupervised knowledge-based matrix factorization methods, independent component analysis (ICA) and nonnegative matrix factorization (NMF) are integrated to identify significant genes and related pathways in microarray gene expression dataset of Alzheimer’s disease. The advantage of these two approaches is they can be performed as a biclustering method by which genes and conditions can be clustered simultaneously. Furthermore, they can group genes into different categories for identifying related diagnostic pathways and regulatory networks. The difference between these two method lies in ICA assume statistical independence of the expression modes, while NMF need positivity constrains to generate localized gene expression profiles. Results In our work, we performed FastICA and non-smooth NMF methods on DNA microarray gene expression data of Alzheimer’s disease respectively. The simulation results shows that both of the methods can clearly classify severe AD samples from control samples, and the biological analysis of the identified significant genes and their related pathways demonstrated that these genes play a prominent role in AD and relate the activation patterns to AD phenotypes. It is validated that the combination of these two methods is efficient. Conclusions Unsupervised matrix factorization methods provide efficient tools to analyze high-throughput microarray dataset. According to the facts that different unsupervised approaches explore correlations in the high-dimensional data space and identify relevant subspace base on different hypotheses, integrating these methods to explore the underlying biological information from microarray dataset is an efficient approach. By combining the significant genes identified by both ICA and NMF, the biological analysis shows great efficient for elucidating the molecular taxonomy of Alzheimer’s disease and enable better experimental design to further identify potential pathways and therapeutic targets of AD.</p

Propagating semantic information in biochemical network models

Author: A Levchenko
B Dost
BP Kelley
C Huang
E Nabieva
Edda Klipp
F Hynne
F Krause
G Salton
J Becker
J Gamalielsson
J Sevilla
K Degtyarenko
K Moutselos
M Goodfellow
M Hattori
M Hucka
M Kanehisa
M Schulz
M Schulz
Marvin Schulz
N Le Novère
N Le Novère
P Lord
Q Yang
R Pinter
R Randhawa
R Singh
S Gay
S Wernicke
T Shlomi
V Fionda
Wolfram Liebermeister
Y Tohsato
YT Wang
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background To enable automatic searches, alignments, and model combination, the elements of systems biology models need to be compared and matched across models. Elements can be identified by machine-readable biological annotations, but assigning such annotations and matching non-annotated elements is tedious work and calls for automation. Results A new method called "semantic propagation" allows the comparison of model elements based not only on their own annotations, but also on annotations of surrounding elements in the network. One may either propagate feature vectors, describing the annotations of individual elements, or quantitative similarities between elements from different models. Based on semantic propagation, we align partially annotated models and find annotations for non-annotated model elements. Conclusions Semantic propagation and model alignment are included in the open-source library semanticSBML, available on sourceforge. Online services for model alignment and for annotation prediction can be used at <url>http://www.semanticsbml.org</url>.</p

Ranked retrieval of Computational Biology models

Author: A Finney
AA Cuellar
Andre Peters
B Zhang
BG Olivier
C Knüpfer
C Laibe
C Li
C Taylor
CMM Lloyd
D Köhn
Dagmar Waltemath
E Klipp
G Salton
GD Bader
H Ogata
K Degtyarenko
L Endler
Lukas Endler
M Ashburner
M Kanehisa
M Lange
N Le Novère
N Le Novère
Nicolas Le Novère
O Gospodnetic
R Baeza-Yates
R Ferber
Ron Henkel
TR Gruber
W Liebermeister
W Tracz
Y Li
Publication venue: BioMed Central
Publication date: 01/08/2010
Field of study

Abstract Background The study of biological systems demands computational support. If targeting a biological problem, the reuse of existing computational models can save time and effort. Deciding for potentially suitable models, however, becomes more challenging with the increasing number of computational models available, and even more when considering the models' growing complexity. Firstly, among a set of potential model candidates it is difficult to decide for the model that best suits ones needs. Secondly, it is hard to grasp the nature of an unknown model listed in a search result set, and to judge how well it fits for the particular problem one has in mind. Results Here we present an improved search approach for computational models of biological processes. It is based on existing retrieval and ranking methods from Information Retrieval. The approach incorporates annotations suggested by MIRIAM, and additional meta-information. It is now part of the search engine of BioModels Database, a standard repository for computational models. Conclusions The introduced concept and implementation are, to our knowledge, the first application of Information Retrieval techniques on model search in Computational Systems Biology. Using the example of BioModels Database, it was shown that the approach is feasible and extends the current possibilities to search for relevant models. The advantages of our system over existing solutions are that we incorporate a rich set of meta-information, and that we provide the user with a relevance ranking of the models found for a query. Better search capabilities in model databases are expected to have a positive effect on the reuse of existing models.</p

Universität Rostock, Lehrstuhl Datenbank- und Informationssysteme: Dbis Repository

DESY Publication Database

外汇风险溢酬理论述评

Author: Borecka I.
Böckmann K.
Diaz J.
Heeren U.
Kidd J.
Liebermeister U.
Lohrmann E.
Mandelli L.
Mosca L.
Nellen B.
Paul E.
Pelosi V.
Ratti S.
Raubold E.
Söding P.
Tallone L.
Wagini B.
Wolff S.
Publication venue
Publication date: 01/01/1966
Field of study

外汇风险溢酬是从资产定价角度研究汇率变化的核心内容,但还未获得一致结论。目前,对外汇风险溢酬的时间序列建模并不理想,隐含变量模型和仿射模型都不能刻画外汇风险溢酬的时间序列特征;对外汇风险溢酬风险因子的研究缺乏一个统一框架,消费、微观市场因子和货币政策都只能部分解释外汇风险溢酬的变化。基于随机贴现因子的模型目前相对零散,但这一框架是后续研究的重点。一个亟待研究的课题是既把汇率作为投资性资产的价格,又考虑汇率作为两国货币的相对比价,研究外汇风险溢酬与两国经济波动、两国经济相关性的内在联系,从理论上厘清影响外汇风险溢酬的因素

DESY

CERN Document Server

Open Access Repository

Xiamen University Institutional Repository

Hydrophobicity and Charge Shape Cellular Metabolite Concentrations

Author: A Danchin
A Fersht
A Kummel
A Shrake
A Zarrinpar
AL Hopkins
Arren Bar-Even
Avi Flamholz
BD Bennett
BY Feng
BY Feng
CA Lipinski
DH Williams
E McCammick
Elad Noor
G Wachtershauser
I Nobeli
IA Berg
IV Tetko
J Bergstrom
J Seidler
J Thioulouse
JA Reynolds
JC Ewald
Jennifer L. Reed
Joerg M. Buescher
K Palm
KE Coan
L Wu
LC James
MA Oberhardt
N Ishii
NM O'Boyle
O Khersonsky
O Sínanoĝlu
P Stenberg
RJ Kleijn
RJ Williams
Ron Milo
SM Fendt
T Cheng
TJ Richmond
V Srinivasan
W Liebermeister
Y Benjamini
Publication venue: Public Library of Science
Publication date: 01/10/2011
Field of study

What governs the concentrations of metabolites within living cells? Beyond specific metabolic and enzymatic considerations, are there global trends that affect their values? We hypothesize that the physico-chemical properties of metabolites considerably affect their in-vivo concentrations. The recently achieved experimental capability to measure the concentrations of many metabolites simultaneously has made the testing of this hypothesis possible. Here, we analyze such recently available data sets of metabolite concentrations within E. coli, S. cerevisiae, B. subtilis and human. Overall, these data sets encompass more than twenty conditions, each containing dozens (28-108) of simultaneously measured metabolites. We test for correlations with various physico-chemical properties and find that the number of charged atoms, non-polar surface area, lipophilicity and solubility consistently correlate with concentration. In most data sets, a change in one of these properties elicits a ∼100 fold increase in metabolite concentrations. We find that the non-polar surface area and number of charged atoms account for almost half of the variation in concentrations in the most reliable and comprehensive data set. Analyzing specific groups of metabolites, such as amino-acids or phosphorylated nucleotides, reveals even a higher dependence of concentration on hydrophobicity. We suggest that these findings can be explained by evolutionary constraints imposed on metabolite concentrations and discuss possible selective pressures that can account for them. These include the reduction of solute leakage through the lipid membrane, avoidance of deleterious aggregates and reduction of non-specific hydrophobic binding. By highlighting the global constraints imposed on metabolic pathways, future research could shed light onto aspects of biochemical evolution and the chemical constraints that bound metabolic engineering efforts

Public Library of Science (PLOS)

Repository for Publications and Research Data