71 research outputs found
Evaluating pathway enumeration algorithms in metabolic engineering case studies
The design of cell factories for the production of compounds involves the search for suitable heterologous pathways. Different strategies have been proposed to infer such pathways, but most are optimization approaches with specific objective functions, not suited to enumerate multiple pathways. In this work, we analyze two pathway enumeration algorithms based on graph representations: the Solution Structure Generation and the Find Path algorithms. Both are capable of enumerating exhaustively multiple pathways using network topology. We study their capabilities and limitations when designing novel heterologous pathways, by applying these methods on two case studies of synthetic metabolic engineering related to the production of butanol and vanillin
Computationally efficient flux variability analysis
<p>Abstract</p> <p>Background</p> <p>Flux variability analysis is often used to determine robustness of metabolic models in various simulation conditions. However, its use has been somehow limited by the long computation time compared to other constraint-based modeling methods.</p> <p>Results</p> <p>We present an open source implementation of flux variability analysis called fastFVA. This efficient implementation makes large-scale flux variability analysis feasible and tractable allowing more complex biological questions regarding network flexibility and robustness to be addressed.</p> <p>Conclusions</p> <p>Networks involving thousands of biochemical reactions can be analyzed within seconds, greatly expanding the utility of flux variability analysis in systems biology.</p
OptForce: An Optimization Procedure for Identifying All Genetic Manipulations Leading to Targeted Overproductions
Computational procedures for predicting metabolic interventions leading to the overproduction of biochemicals in microbial strains are widely in use. However, these methods rely on surrogate biological objectives (e.g., maximize growth rate or minimize metabolic adjustments) and do not make use of flux measurements often available for the wild-type strain. In this work, we introduce the OptForce procedure that identifies all possible engineering interventions by classifying reactions in the metabolic model depending upon whether their flux values must increase, decrease or become equal to zero to meet a pre-specified overproduction target. We hierarchically apply this classification rule for pairs, triples, quadruples, etc. of reactions. This leads to the identification of a sufficient and non-redundant set of fluxes that must change (i.e., MUST set) to meet a pre-specified overproduction target. Starting with this set we subsequently extract a minimal set of fluxes that must actively be forced through genetic manipulations (i.e., FORCE set) to ensure that all fluxes in the network are consistent with the overproduction objective. We demonstrate our OptForce framework for succinate production in Escherichia coli using the most recent in silico E. coli model, iAF1260. The method not only recapitulates existing engineering strategies but also reveals non-intuitive ones that boost succinate production by performing coordinated changes on pathways distant from the last steps of succinate synthesis
Large-Scale Bi-Level Strain Design Approaches and Mixed-Integer Programming Solution Techniques
The use of computational models in metabolic engineering has been increasing as more genome-scale metabolic models and computational approaches become available. Various computational approaches have been developed to predict how genetic perturbations affect metabolic behavior at a systems level, and have been successfully used to engineer microbial strains with improved primary or secondary metabolite production. However, identification of metabolic engineering strategies involving a large number of perturbations is currently limited by computational resources due to the size of genome-scale models and the combinatorial nature of the problem. In this study, we present (i) two new bi-level strain design approaches using mixed-integer programming (MIP), and (ii) general solution techniques that improve the performance of MIP-based bi-level approaches. The first approach (SimOptStrain) simultaneously considers gene deletion and non-native reaction addition, while the second approach (BiMOMA) uses minimization of metabolic adjustment to predict knockout behavior in a MIP-based bi-level problem for the first time. Our general MIP solution techniques significantly reduced the CPU times needed to find optimal strategies when applied to an existing strain design approach (OptORF) (e.g., from ∼10 days to ∼5 minutes for metabolic engineering strategies with 4 gene deletions), and identified strategies for producing compounds where previous studies could not (e.g., malate and serine). Additionally, we found novel strategies using SimOptStrain with higher predicted production levels (for succinate and glycerol) than could have been found using an existing approach that considers network additions and deletions in sequential steps rather than simultaneously. Finally, using BiMOMA we found novel strategies involving large numbers of modifications (for pyruvate and glutamate), which sequential search and genetic algorithms were unable to find. The approaches and solution techniques developed here will facilitate the strain design process and extend the scope of its application to metabolic engineering
Computational Design of Auxotrophy-Dependent Microbial Biosensors for Combinatorial Metabolic Engineering Experiments
Combinatorial approaches in metabolic engineering work by generating genetic diversity in a microbial population followed by screening for strains with improved phenotypes. One of the most common goals in this field is the generation of a high rate chemical producing strain. A major hurdle with this approach is that many chemicals do not have easy to recognize attributes, making their screening expensive and time consuming. To address this problem, it was previously suggested to use microbial biosensors to facilitate the detection and quantification of chemicals of interest. Here, we present novel computational methods to: (i) rationally design microbial biosensors for chemicals of interest based on substrate auxotrophy that would enable their high-throughput screening; (ii) predict engineering strategies for coupling the synthesis of a chemical of interest with the production of a proxy metabolite for which high-throughput screening is possible via a designed bio-sensor. The biosensor design method is validated based on known genetic modifications in an array of E. coli strains auxotrophic to various amino-acids. Predicted chemical production rates achievable via the biosensor-based approach are shown to potentially improve upon those predicted by current rational strain design approaches. (A Matlab implementation of the biosensor design method is available via http://www.cs.technion.ac.il/~tomersh/tools)
Identification of Functional Differences in Metabolic Networks Using Comparative Genomics and Constraint-Based Models
Genome-scale network reconstructions are useful tools for understanding cellular metabolism, and comparisons of such reconstructions can provide insight into metabolic differences between organisms. Recent efforts toward comparing genome-scale models have focused primarily on aligning metabolic networks at the reaction level and then looking at differences and similarities in reaction and gene content. However, these reaction comparison approaches are time-consuming and do not identify the effect network differences have on the functional states of the network. We have developed a bilevel mixed-integer programming approach, CONGA, to identify functional differences between metabolic networks by comparing network reconstructions aligned at the gene level. We first identify orthologous genes across two reconstructions and then use CONGA to identify conditions under which differences in gene content give rise to differences in metabolic capabilities. By seeking genes whose deletion in one or both models disproportionately changes flux through a selected reaction (e.g., growth or by-product secretion) in one model over another, we are able to identify structural metabolic network differences enabling unique metabolic capabilities. Using CONGA, we explore functional differences between two metabolic reconstructions of Escherichia coli and identify a set of reactions responsible for chemical production differences between the two models. We also use this approach to aid in the development of a genome-scale model of Synechococcus sp. PCC 7002. Finally, we propose potential antimicrobial targets in Mycobacterium tuberculosis and Staphylococcus aureus based on differences in their metabolic capabilities. Through these examples, we demonstrate that a gene-centric approach to comparing metabolic networks allows for a rapid comparison of metabolic models at a functional level. Using CONGA, we can identify differences in reaction and gene content which give rise to different functional predictions. Because CONGA provides a general framework, it can be applied to find functional differences across models and biological systems beyond those presented here
Recommended from our members
Multi-objective optimization of genome-scale metabolic models: the case of ethanol production
Ethanol is among the largest fermentation product used worldwide, accounting for more than 90% of all biofuel produced in the last decade. However current production methods of ethanol are unable to meet the requirements of increasing global demand, because of low yields on glucose sources. In this work, we present an in silico multi-objective optimization and analyses of eight genome-scale metabolic networks for the overproduction of ethanol within the engineered cell. We introduce MOME (multi-objective metabolic engineering) algorithm, that models both gene knockouts and enzymes up and down regulation using the Redirector framework. In a multi-step approach, MOME tackles the multi-objective optimization of biomass and ethanol production in the engineered strain; and performs genetic design and clustering analyses on the optimization results. We find in silico E. coli Pareto optimal strains with a knockout cost of 14 characterized by an ethanol production up to 19.74mmolgDW−1h−1 (+832.88% with respect to wild-type) and biomass production of 0.02h−1 (−98.06% ). The analyses on E. coli highlighted a single knockout strategy producing 16.49mmolgDW−1h−1 (+679.29% ) ethanol, with biomass equals to 0.23h−1 (−77.45% ). We also discuss results obtained by applying MOME to metabolic models of: (i) S. aureus; (ii) S. enterica; (iii) Y. pestis; (iv) S. cerevisiae; (v) C. reinhardtii; (vi) Y. lipolytica. We finally present a set of simulations in which constrains over essential genes and minimum allowable biomass were included. A bound over the maximum allowable biomass was also added, along with other settings representing rich media compositions. In the same conditions the maximum improvement in ethanol production is +195.24%
Flux-sum analysis: a metabolite-centric approach for understanding the metabolic network
<p>Abstract</p> <p>Background</p> <p>Constraint-based flux analysis of metabolic network model quantifies the reaction flux distribution to characterize the state of cellular metabolism. However, metabolites are key players in the metabolic network and the current reaction-centric approach may not account for the effect of metabolite perturbation on the cellular physiology due to the inherent limitation in model formulation. Thus, it would be practical to incorporate the metabolite states into the model for the analysis of the network.</p> <p>Results</p> <p>Presented herein is a metabolite-centric approach of analyzing the metabolic network by including the turnover rate of metabolite, known as flux-sum, as key descriptive variable within the model formulation. By doing so, the effect of varying metabolite flux-sum on physiological change can be simulated by resorting to mixed integer linear programming. From the results, we could classify various metabolite types based on the flux-sum profile. Using the <it>i</it>AF1260 <it>in silico </it>metabolic model of <it>Escherichia coli</it>, we demonstrated that this novel concept complements the conventional reaction-centric analysis.</p> <p>Conclusions</p> <p>Metabolite flux-sum analysis elucidates the roles of metabolites in the network. In addition, this metabolite perturbation analysis identifies the key metabolites, implicating practical application which is achievable through metabolite flux-sum manipulation in the areas of biotechnology and biomedical research.</p
A Genome-Scale Metabolic Reconstruction of Mycoplasma genitalium, iPS189
With a genome size of ∼580 kb and approximately 480 protein coding regions, Mycoplasma genitalium is one of the smallest known self-replicating organisms and, additionally, has extremely fastidious nutrient requirements. The reduced genomic content of M. genitalium has led researchers to suggest that the molecular assembly contained in this organism may be a close approximation to the minimal set of genes required for bacterial growth. Here, we introduce a systematic approach for the construction and curation of a genome-scale in silico metabolic model for M. genitalium. Key challenges included estimation of biomass composition, handling of enzymes with broad specificities, and the lack of a defined medium. Computational tools were subsequently employed to identify and resolve connectivity gaps in the model as well as growth prediction inconsistencies with gene essentiality experimental data. The curated model, M. genitalium iPS189 (262 reactions, 274 metabolites), is 87% accurate in recapitulating in vivo gene essentiality results for M. genitalium. Approaches and tools described herein provide a roadmap for the automated construction of in silico metabolic models of other organisms
Inferring Carbon Sources from Gene Expression Profiles Using Metabolic Flux Models
Background:
Bacteria have evolved the ability to efficiently and resourcefully adapt to changing environments. A key means by which they optimize their use of available nutrients is through adjustments in gene expression with consequent changes in enzyme activity. We report a new method for drawing environmental inferences from gene expression data. Our method prioritizes a list of candidate carbon sources for their compatibility with a gene expression profile using the framework of flux balance analysis to model the organism’s metabolic network.
Principal Findings:
For each of six gene expression profiles for Escherichia coli grown under differing nutrient conditions, we applied our method to prioritize a set of eighteen different candidate carbon sources. Our method ranked the correct carbon source as one of the top three candidates for five of the six expression sets when used with a genome-scale model. The correct candidate ranked fifth in the remaining case. Additional analyses show that these rankings are robust with respect to biological and measurement variation, and depend on specific gene expression, rather than general expression level. The gene expression profiles are highly adaptive: simulated production of biomass averaged 94.84% of maximum when the in silico carbon source matched the in vitro source of the expression profile, and 65.97% when it did not.
Conclusions:
Inferences about a microorganism’s nutrient environment can be made by integrating gene expression data into a metabolic framework. This work demonstrates that reaction flux limits for a model can be computed which are realistic in the sense that they affect in silico growth in a manner analogous to that in which a microorganism’s alteration of gene expression is adaptive to its nutrient environment.National Institute of Allergy and Infectious Diseases (U.S.) (grant HHSN 2722008000059C)National Institute of Allergy and Infectious Diseases (U.S.) (grant HHSN 26620040000IC)Bill & Melinda Gates Foundation (grant 18651010-37352-A
- …