6 research outputs found
Network analysis of metabolic enzyme evolution in Escherichia coli
BACKGROUND: The two most common models for the evolution of metabolism are the patchwork evolution model, where enzymes are thought to diverge from broad to narrow substrate specificity, and the retrograde evolution model, according to which enzymes evolve in response to substrate depletion. Analysis of the distribution of homologous enzyme pairs in the metabolic network can shed light on the respective importance of the two models. We here investigate the evolution of the metabolism in E. coli viewed as a single network using EcoCyc. RESULTS: Sequence comparison between all enzyme pairs was performed and the minimal path length (MPL) between all enzyme pairs was determined. We find a strong over-representation of homologous enzymes at MPL 1. We show that the functionally similar and functionally undetermined enzyme pairs are responsible for most of the over-representation of homologous enzyme pairs at MPL 1. CONCLUSIONS: The retrograde evolution model predicts that homologous enzymes pairs are at short metabolic distances from each other. In general agreement with previous studies we find that homologous enzymes occur close to each other in the network more often than expected by chance, which lends some support to the retrograde evolution model. However, we show that the homologous enzyme pairs which may have evolved through retrograde evolution, namely the pairs that are functionally dissimilar, show a weaker over-representation at MPL 1 than the functionally similar enzyme pairs. Our study indicates that, while the retrograde evolution model may have played a small part, the patchwork evolution model is the predominant process of metabolic enzyme evolution
Horizontal and vertical growth of S. cerevisiae metabolic network
<p>Abstract</p> <p>Background</p> <p>The growth and development of a biological organism is reflected by its metabolic network, the evolution of which relies on the essential gene duplication mechanism. There are two current views about the evolution of metabolic networks. The retrograde model hypothesizes that a pathway evolves by recruiting novel enzymes in a direction opposite to the metabolic flow. The patchwork model is instead based on the assumption that the evolution is based on the exploitation of broad-specificity enzymes capable of catalysing a variety of metabolic reactions.</p> <p>Results</p> <p>We analysed a well-studied unicellular eukaryotic organism, <it>S. cerevisiae</it>, and studied the effect of the removal of paralogous gene products on its metabolic network. Our results, obtained using different paralog and network definitions, show that, after an initial period when gene duplication was indeed instrumental in expanding the metabolic space, the latter reached an equilibrium and subsequent gene duplications were used as a source of more specialized enzymes rather than as a source of novel reactions. We also show that the switch between the two evolutionary strategies in <it>S. cerevisiae </it>can be dated to about 350 million years ago.</p> <p>Conclusions</p> <p>Our data, obtained through a novel analysis methodology, strongly supports the hypothesis that the patchwork model better explains the more recent evolution of the <it>S. cerevisiae </it>metabolic network. Interestingly, the effects of a patchwork strategy acting before the Euascomycete-Hemiascomycete divergence are still detectable today.</p
Evolutionarily Conserved Substrate Substructures for Automated Annotation of Enzyme Superfamilies
The evolution of enzymes affects how well a species can adapt to new environmental conditions. During enzyme evolution, certain aspects of molecular function are conserved while other aspects can vary. Aspects of function that are more difficult to change or that need to be reused in multiple contexts are often conserved, while those that vary may indicate functions that are more easily changed or that are no longer required. In analogy to the study of conservation patterns in enzyme sequences and structures, we have examined the patterns of conservation and variation in enzyme function by analyzing graph isomorphisms among enzyme substrates of a large number of enzyme superfamilies. This systematic analysis of substrate substructures establishes the conservation patterns that typify individual superfamilies. Specifically, we determined the chemical substructures that are conserved among all known substrates of a superfamily and the substructures that are reacting in these substrates and then examined the relationship between the two. Across the 42 superfamilies that were analyzed, substantial variation was found in how much of the conserved substructure is reacting, suggesting that superfamilies may not be easily grouped into discrete and separable categories. Instead, our results suggest that many superfamilies may need to be treated individually for analyses of evolution, function prediction, and guiding enzyme engineering strategies. Annotating superfamilies with these conserved and reacting substructure patterns provides information that is orthogonal to information provided by studies of conservation in superfamily sequences and structures, thereby improving the precision with which we can predict the functions of enzymes of unknown function and direct studies in enzyme engineering. Because the method is automated, it is suitable for large-scale characterization and comparison of fundamental functional capabilities of both characterized and uncharacterized enzyme superfamilies
MI-NODES multiscale models of metabolic reactions, brain connectome, ecological, epidemic, world trade, and legal-social networks
[Abstract] Complex systems and networks appear in almost all areas of reality. We find then from proteins residue networks to Protein Interaction Networks (PINs). Chemical reactions form Metabolic Reactions Networks (MRNs) in living beings or Atmospheric reaction networks in planets and moons. Network of neurons appear in the worm C. elegans, in Human brain connectome, or in Artificial Neural Networks (ANNs). Infection spreading networks exist for contagious outbreaks networks in humans and in malware epidemiology for infection with viral software in internet or wireless networks. Social-legal networks with different rules evolved from swarm intelligence, to hunter-gathered societies, or citation networks of U.S. Supreme Court. In all these cases, we can see the same question. Can we predict the links based on structural information? We propose to solve the problem using Quantitative Structure-Property Relationship (QSPR) techniques commonly used in chemo-informatics. In so doing, we need software able to transform all types of networks/graphs like drug structure, drug-target interactions, protein structure, protein interactions, metabolic reactions, brain connectome, or social networks into numerical parameters. Consequently, we need to process in alignment-free mode multitarget, multiscale, and multiplexing, information. Later, we have to seek the QSPR model with Machine Learning techniques. MI-NODES is this type of software. Here we review the evolution of the software from chemoinformatics to bioinformatics and systems biology. This is an effort to develop a universal tool to study structure-property relationships in complex systems
Genome-scale metabolic modeling of cyanbacteria: network structure, interactions, reconstruction and dynamics
2016 Fall.Includes bibliographical references.Metabolic network modeling, a field of systems biology and bioengineering, enhances the quantitative predictive understanding of cellular metabolism and thereby assists in the development of model-guided metabolic engineering strategies. Metabolic models use genome-scale network reconstructions, and combine it with mathematical methods for quantitative prediction. Metabolic system reconstructions, contain information on genes, enzymes, reactions, and metabolites, and are converted into two types of networks: (i) gene-enzyme-reaction, and (ii) reaction-metabolite. The former details the links between the genes that are known to code for metabolic enzymes, and the reaction pathways that the enzymes participate in. The latter details the chemical transformation of metabolites, step by step, into biomass and energy. The latter network is transformed into a system of equations and simulated using different methods. Prominent among these are constraint-based methods, especially Flux Balance Analysis, which utilizes linear programming tools to predict intracellular fluxes of single cells. Over the past 25 years, metabolic network modeling has had a range of applications in the fields of model-driven discovery, prediction of cellular phenotypes, analysis of biological network properties, multi-species interactions, engineering of microbes for product synthesis, and studying evolutionary processes. This thesis is concerned with the development and application of metabolic network modeling to cyanobacteria as well as E. coli. Chapter 1 is a brief survey of the past, present, and future of constraint-based modeling using flux balance analysis in systems biology. It includes discussion of (i) formulation, (ii) assumption, (iii) variety, (iv) availability, and (v) future directions in the field of constraint based modeling. Chapter 2, explores the enzyme-reaction networks of metabolic reconstructions belonging to various organisms; and finds that the distribution of the number of reactions an enzyme participates in, i.e. the enzyme-reaction distribution, is surprisingly similar. The role of this distribution in the robustness of the organism is also explored. Chapter 3, applies flux balance analysis on models of E. coli, Synechocystis sp. PCC6803, and C. reinhardtii to understand epistatic interactions between metabolic genes and pathways. We show that epistatic interactions are dependent on the environmental conditions, i.e. carbon source, carbon/oxygen ratio in E. coli, and light intensity in Synechocystis sp. PCC6803 and C. reinhardtii. Cyanobacteria are photosynthetic organisms and have great potential for metabolic engineering to produce commercially important chemicals such as biofuels, pharmaceuticals, and nutraceuticals. Chapter 4 presents our new genome scale reconstruction of the model cyanobacterium, Synechocystis sp. PCC6803, called iCJ816. This reconstruction was analyzed and compared to experimental studies, and used for predicting the capacity of the organism for (i) carbon dioxide remediation, and (ii) production of intracellular chemical species. Chapter 5 uses our new model iCJ816 for dynamic analysis under diurnal growth simulations. We discuss predictions of different optimization schemes, and present a scheme that qualitatively matches observations
Herramientas informáticas y de inteligencia artificial para el meta-análisis en la frontera entre la bioinformática y las ciencias jurídicas
[Resumen] Los modelos computacionales, conocidos por su acrónimo en idioma
Inglés como QSPR (Quantitative Structure-Property Relationships) pueden
usarse para predecir propiedades de sistemas complejos. Estas predicciones
representan una aplicación importante de las Tecnologías de la Información
y la Comunicación (TICs). La mayor relevancia es debido a la reducción de
costes de medición experimental en términos de tiempo, recursos humanos,
recursos materiales, y/o el uso de animales de laboratorio en ciencias biomoleculares,
técnicas, sociales y/o jurídicas.
Las Redes Neuronales Artificiales (ANNs) son una de las herramientas
informáticas más poderosas para buscar modelos QSPR. Para ello, las
ANNs pueden usar como variables de entrada (input) parámetros
numéricos que cuantifiquen información sobre la estructura del sistema.
Los parámetros conocidos como Índices Topológicos (TIs) se encuentran
entre los más versátiles.
Los TIs se calculan en Teoría de Grafos a partir de la representación de
cualquier sistema como una red de nodos interconectados; desde moléculas
a redes biológicas, tecnológicas, y sociales. Esta tesis tiene como primer
objetivo realizar una revisión y/o introducir nuevos TIs y software de
cálculo de TIs útiles como inputs de ANNs para el desarrollo de modelos
QSPR de redes bio-moleculares, biológicas, tecnológico-económicas y
socio-jurídicas. En ellas, por una parte, los nodos representan biomoléculas,
organismos, poblaciones, leyes tributarias o concausas de
delitos. Por otra parte, en la interacción TICs-Ciencias Biomoleculares-
Derecho se hace necesario un marco de seguridad jurídica que permita el
adecuado desarrollo de las TICs y sus aplicaciones en Ciencias Biomoleculares.
Por eso, el segundo objetivo de esta tesis es revisar el marco
jurídico-legal de protección de los modelos QSAR/QSPR de sistemas
moleculares.
El presente trabajo de investigación pretende demostrar la utilidad de
estos modelos para predecir características y propiedades de estos sistemas
complejos.[Resumo] Os modelos de ordenador coñecidos pola súas iniciais en inglés QSPR
(Quantitative Structure-Property Relationships) poden prever as
propiedades de sistemas complexos e reducir os custos experimentais en
termos de tempo, recursos humanos, materiais e/ou o uso de animais de
laboratorio nas ciencias biomoleculares, técnicas, e sociais.
As Redes Neurais Artificiais (ANNs) son unha das ferramentas máis
poderosas para buscar modelos QSPR. Para iso, as ANNs poden facer uso,
coma variables de entrada (input), dos parámetros numéricos da estrutura
do sistema chamados Índices Topolóxicos (TIs).
Os TI calcúlanse na teoría dos grafos a partir da representación do sistema
coma unha rede de nós conectados, incluíndo tanto moléculas coma redes
sociais e tecnolóxicas. Esta tese ten como obxectivo principal revisar e/ou
desenvolver novos TIs, programas de cálculo de TIs, e/ou modelos QSPR
facendo uso de ANNs para predicir redes bio-moleculares, biolóxicas,
económicas, e sociais ou xurídicas onde os nós representan moléculas
biolóxicas, organismos, poboacións, ou as leis fiscais ou as concausas dun
delito. Ademais, a interacción das TIC con as ciencias biolóxicas e
xurídicas necesita dun marco de seguridade xurídica que permita o bo
desenvolvemento das TIC e as súas aplicacións en Ciencias
Biomoleculares. Polo tanto, o segundo obxectivo desta tese é analizar o
marco xurídico e legal de protección dos modelos QSPR.
O presente traballo de investigación pretende demostrar a utilidade destes
modelos para predicir características e propiedades destes sistemas
complexos.[Abstract] QSPR (Quantitative Structure-Property Relationships) computer models
can predict properties of complex systems reducing experimental costs in
terms of time, human resources, material resources, and/or the use of
laboratory animals in bio-molecular, technical, and/or social sciences.
Artificial Neural Networks (ANNs) are one of the most powerful tools to
search QSPR models. For this, the ANNs may use as input variables
numerical parameters of the system structure called Topological Indices
(TIs).
The TIs are calculated in Graph Theory from a representation of any
system as a network of interconnected nodes, including molecules or social
and technological networks. The first aim of this thesis is to review and/or
develop new TIs, TIs calculation software, and QSPR models using ANNs
to predict bio-molecular, biological, commercial, social, and legal networks
where nodes represent bio-molecules, organisms, populations, products, tax
laws, or criminal causes. Moreover, the interaction of ICTs with
Biomolecular and law Sciences needs a legal security framework that
allows the proper development of ICTs and their applications in Biomolecular
Sciences. Therefore, the second objective of this thesis is to
review the legal framework and legal protection of QSPR techniques.
The present work of investigation tries to demonstrate the usefulness of
these models to predict characteristics and properties of these complex
systems