17 research outputs found

    A reference map of potential determinants for the human serum metabolome

    Get PDF
    The serum metabolome contains a plethora of biomarkers and causative agents of various diseases, some of which are endogenously produced and some that have been taken up from the environment(1). The origins of specific compounds are known, including metabolites that are highly heritable(2,3), or those that are influenced by the gut microbiome(4), by lifestyle choices such as smoking(5), or by diet(6). However, the key determinants of most metabolites are still poorly understood. Here we measured the levels of 1,251 metabolites in serum samples from a unique and deeply phenotyped healthy human cohort of 491 individuals. We applied machine-learning algorithms to predict metabolite levels in held-out individuals on the basis of host genetics, gut microbiome, clinical parameters, diet, lifestyle and anthropometric measurements, and obtained statistically significant predictions for more than 76% of the profiled metabolites. Diet and microbiome had the strongest predictive power, and each explained hundreds of metabolites-in some cases, explaining more than 50% of the observed variance. We further validated microbiome-related predictions by showing a high replication rate in two geographically independent cohorts(7,8) that were not available to us when we trained the algorithms. We used feature attribution analysis(9) to reveal specific dietary and bacterial interactions. We further demonstrate that some of these interactions might be causal, as some metabolites that we predicted to be positively associated with bread were found to increase after a randomized clinical trial of bread intervention. Overall, our results reveal potential determinants of more than 800 metabolites, paving the way towards a mechanistic understanding of alterations in metabolites under different conditions and to designing interventions for manipulating the levels of circulating metabolites.The levels of 1,251 metabolites are measured in 475 phenotyped individuals, and machine-learning algorithms reveal that diet and the microbiome are the determinants with the strongest predictive power for the levels of these metabolites

    Hierarchy of non-glucose sugars in Escherichia coli.

    Get PDF
    BACKGROUND Understanding how cells make decisions, and why they make the decisions they make, is of fundamental interest in systems biology. To address this, we study the decisions made by E. coli on which genes to express when presented with two different sugars. It is well-known that glucose, E. coli's preferred carbon source, represses the uptake of other sugars by means of global and gene-specific mechanisms. However, less is known about the utilization of glucose-free sugar mixtures which are found in the natural environment of E. coli and in biotechnology. RESULTS Here, we combine experiment and theory to map the choices of E. coli among 6 different non-glucose carbon sources. We used robotic assays and fluorescence reporter strains to make precise measurements of promoter activity and growth rate in all pairs of these sugars. We find that the sugars can be ranked in a hierarchy: in a mixture of a higher and a lower sugar, the lower sugar system shows reduced promoter activity. The hierarchy corresponds to the growth rate supported by each sugar- the faster the growth rate, the higher the sugar on the hierarchy. The hierarchy is 'soft' in the sense that the lower sugar promoters are not completely repressed. Measurement of the activity of the master regulator CRP-cAMP shows that the hierarchy can be quantitatively explained based on differential activation of the promoters by CRP-cAMP. Comparing sugar system activation as a function of time in sugar pair mixtures at sub-saturating concentrations, we find cases of sequential activation, and also cases of simultaneous expression of both systems. Such simultaneous expression is not predicted by simple models of growth rate optimization, which predict only sequential activation. We extend these models by suggesting multi-objective optimization for both growing rapidly now and preparing the cell for future growth on the poorer sugar. CONCLUSION We find a defined hierarchy of sugar utilization, which can be quantitatively explained by differential activation by the master regulator cAMP-CRP. The present approach can be used to understand cell decisions when presented with mixtures of conditions

    Linear Superposition and Prediction of Bacterial Promoter Activity Dynamics in Complex Conditions

    No full text
    <div><p>Bacteria often face complex environments. We asked how gene expression in complex conditions relates to expression in simpler conditions. To address this, we obtained accurate promoter activity dynamical measurements on 94 genes in <i>E. coli</i> in environments made up of all possible combinations of four nutrients and stresses. We find that the dynamics across conditions is well described by two principal component curves specific to each promoter. As a result, the promoter activity dynamics in a combination of conditions is a weighted average of the dynamics in each condition alone. The weights tend to sum up to approximately one. This weighted-average property, called linear superposition, allows predicting the promoter activity dynamics in a combination of conditions based on measurements of pairs of conditions. If these findings apply more generally, they can vastly reduce the number of experiments needed to understand how <i>E. coli</i> responds to the combinatorially huge space of possible environments.</p></div

    An example of deviation from linear combination is found in the <i>LacZ</i> promoter in a diauxic shift experiment.

    No full text
    <p>Promoter activity dynamics in a mixture of 0.04% glucose and 0.4% lactose (Blue line) is far from the best fit linear combination of dynamics of glucose or lactose alone (Black line). Error bars are standard error between three independent experiments on different days.</p

    Promoter activities of genes can be well-described by one or two principal components.

    No full text
    <p>(a) <i>fliY</i> promoter activity dynamics in 15 different measured conditions (15 combinations of conditions A,B,C,D). (b) First two principal components dynamics of <i>fliY</i>, according to principal component analysis of <i>fliY</i> dynamics in all conditions and combinations. (c) Fraction of variance explained by the first two principal components for all 94 promoters in 15 environments. Red arrows: <i>fliY</i> and rrnB with 91% and 99% explained variance. (d) <i>rrnB</i> promoter activity dynamics in 15 different measured conditions (e) The first PC of <i>rrnB</i> according to principal component analysis of <i>rrnB</i> expression dynamics in all conditions and combinations. (f) Fraction of variance explained by only the first principal components for all 94 promoters in 15 environments. Red arrows <i>fliY</i> and rrnB with 53% and 98% explained variance.</p

    Dynamics in triplets and quadruplet is well-predicted by a formula that employs dynamics in pairs.

    No full text
    <p>Right column - prediction of triplet A+B+C (combination of casamino acids, ethanol and H<sub>2</sub>O<sub>2</sub>) – in orange line – follows the measured shape of the dynamics – blue curve. Shown are six representative promoters. The black curve is the best fit linear combination. Left column - same for the quadruplet A+B+C+D (combination of casamino acids, ethanol, H<sub>2</sub>O<sub>2</sub> and NaCl). Error bars are standard error between 4 independent experiments on different days.</p

    Schematic overview of workflow for measuring promoter activity dynamics and the analysis testing linear superposition of dynamics in different conditions.

    No full text
    <p>(a) <i>E. coli</i> reporter strains were grown in defined media conditions in 96-well plates and promoter activity – the rate of GFP accumulation per cell – was found as a function of time. (b) Promoter activity dynamics was measured in four conditions and all of their possible pair, triplet and quadruplet combinations. (c) We tested whether the dynamics in a combined condition is a linear combination (weighted average) of the dynamics in each individual condition. We further asked whether the weights sum up to one, signifying a linear superposition. (d) Finally, we asked whether dynamics in triplet and quadruplet conditions can be predicted based on dynamics in pairs of conditions.</p

    Promoter activity dynamics in combined conditions is diverse and well-described as a linear superposition of dynamics in individual conditions.

    No full text
    <p>Six representative promoters are shown (each row belongs to one promoter. The Promoter name is indicated on the left). First column shows individual and pair conditions A+B, second column shows a triplet condition (A+B+C), and the third column a quadruplet (A+B+C+D). A = 0.05% casamino acids, B = 3% Ethanol, C = 10 µM H<sub>2</sub>O<sub>2</sub>, D = 300 mM NaCl, all added to M9+0.2% glucose defined medium. Dynamics in the combined condition (blue curve) are well-described by the best fit linear superposition of individual condition dynamics (black curve). Error bars are standard error between 4 independent experiments on different days.</p

    The prediction of dynamics in complex conditions based on pairs of conditions has an error that is smaller than the average expected error.

    No full text
    <p>(a) Cumulative histogram of prediction errors of all 4 triplet combinations (in blue) and of the average errors of all other measured conditions (in red) (b) same for the quadruplet.</p
    corecore