40 research outputs found

    Quantitative Model Refinement as a Solution to the Combinatorial Size Explosion of Biomodels

    Get PDF
    AbstractBuilding a large system through a systematic, step-by-step refinement of an initial abstract specification is a well established technique in software engineering, not yet much explored in systems biology. In the case of systems biology, one starts from an abstract, high-level model of a biological system and aims to add more and more details about its reactants and/or reactions, through a number of consecutive refinement steps. The refinement should be done in a quantitatively correct way, so that (some of) the numerical properties of the model (such as the experimental fit and validation) are preserved. In this study, we focus on the data-refinement mechanism where the aim is to increase the level of details of some of the reactants of a given model. That is, we analyse the case when a model is refined by substituting a given species by several types of subspecies. We show in this paper how the refined model can be systematically obtained from the original one. As a case study for this methodology we choose a recently introduced model for the eukaryotic heat shock response, [I. Petre, A. Mizera, C. L. Hyder, A. Meinander, A. Mikhailov, R.I. Morimoto, L. Sistonen, J. E. Eriksson, R.-J. Back, A simple mass-action model for the eukaryotic heat shock response and its mathematical validation, Natural Computing, 10(1), 595–612, 2011.]. We refine this model by including details about the acetylation of the heat shock factors and its influence on the heat shock response. The refined model has a significantly higher number of kinetic parameters and variables. However, we show that our methodology allows us to preserve the experimental fit/validation of the model with minimal computational effort

    A study on systems modeling frameworks and their interoperability

    Get PDF
    'What is in a model?' is a question that any systems modeler has asked at least once when working with a biological system. To answer this question we must understand how every species and reaction in a model are biologically related to each other, which in the case of cellular systems requires an understanding of local protein and gene interactions at the domain level. Typical modeling approaches like reaction-networks do not explicitly encode this information, which can obfuscate the assumptions made by a model and limit their further analysis, comparison and re-use. Additionally, the multi-state nature of the elements in a model can lead to a combinatorial explosion of the interactions in the model, pushing reaction-networks to their limits. Rule-based modeling is a modeling paradigm that addresses these issues through the use of a graph-based representation of each species's interaction domains and their local interactions. In this paradigm, reaction \emph{rules} describe a class of reactions and are represented as graph rewriting operations that encode biological processes like bond formation and post-translational modification. This allows for a concise, explicit encoding of how species interact, modify and are related to each other while at the same time managing the enormous number of potential molecular interactions in a multi-state system. In this work I present two different approaches that aim to bring the benefits of rule-based modeling's graph representation to other modeling paradigms: \emph{Atomizer} and \emph{MCell-R}. Atomizer is an algorithm for the extraction of structural and process information from reaction-network models, which I use to encode the model in a rule-based format. Once we have an explicit understanding of what is in a model and how entities in a model interact, we can then perform meta-modeling operations like model analysis, alignment, comparison and visualization, which I demonstrate through the application of Atomizer to a large dataset of reaction network models. MCell-R is a framework for the efficient modeling and simulation of multi-state, multi-component spatial systems. The framework consists of an integration of the NFsim rule-based simulation engine together with the MCell spatial modeling system, which highlights the utility of bringing rule-based paradigms into reaction based platforms

    Scalable reaction network modeling with automatic validation of consistency in Event-B

    Get PDF
    Constructing a large biological model is a difficult, error-prone process. Small errors in writing a part of the model cascade to the system level and their sources are difficult to trace back. In this paper we extend a recent approach based on Event-B, a state-based formal method with refinement as its central ingredient, allowing us to validate for model consistency step-by-step in an automated way. We demonstrate this approach on a model of the heat shock response in eukaryotes and its scalability on a model of the ErbB signaling pathway. All consistency properties of the model were proved automatically with computer support.</p

    Graph matching, theory and SAT implementation

    Get PDF
    International audienceIn systems biology, large biochemical reaction networks can either be represented by bipartitegraphs or by systems of ordinary differential equations. Modelers want to determine the existenceof reductions between those reaction networks. Because, it is not possible to decide this existencewith equation systems, a previous thesis [1] focussed on graph structures. A subgraph epimorphism(SEPI) framework was developed and gave results close to biologists’ expectations.Three main difficulties of the SEPI framework have been identified. First, establishing whethertwo models are linked through a SEPI is complex and computationally expensive. Second, thenumber of SEPIs found can be huge, making the analysis of SEPI sets between two given graphsvery difficult for biologists. Finally, some existing SEPIs do not have a biological interpretation.This diploma thesis led to three combined ways to improve the framework. One way consisted toredefine the decision problem into an optimisation problem to restrict the set of solutions. Asecond way was to determine, together with biologists, restrictions on one of the framework’soperations in order to filter irrelevant reductions. Lastly, a preprocessing step has been introduced,consisting of rewriting graphs according to subgraph isomorphism relations. The impact ofthese three combined implementations has been evaluated on models of the BioModels database.Results demonstrated that it contributed to make the SEPI framework more relevant, efficientand functional

    Toward Accessible Multilevel Modeling in Systems Biology: A Rule-based Language Concept

    Get PDF
    Promoted by advanced experimental techniques for obtaining high-quality data and the steadily accumulating knowledge about the complexity of life, modeling biological systems at multiple interrelated levels of organization attracts more and more attention recently. Current approaches for modeling multilevel systems typically lack an accessible formal modeling language or have major limitations with respect to expressiveness. The aim of this thesis is to provide a comprehensive discussion on associated problems and needs and to propose a concrete solution addressing them

    Toward Multiscale Models of Cyanobacterial Growth

    Get PDF
    Oxygenic photosynthesis dominates global primary productivity ever since its evolution more than three billion years ago. While many aspects of phototrophic growth are well understood, it remains a considerable challenge to elucidate the manifold dependencies and interconnections between the diverse cellular processes that together facilitate the synthesis of new cells. Phototrophic growth involves the coordinated action of several layers of cellular functioning, ranging from the photosynthetic light reactions and the electron transport chain, to carbon-concentrating mechanisms and the assimilation of inorganic carbon. It requires the synthesis of new building blocks by cellular metabolism, protection against excessive light, as well as diurnal regulation by a circadian clock and the orchestration of gene expression and cell division. Computational modeling allows us to quantitatively describe these cellular functions and processes relevant for phototrophic growth. As yet, however, computational models are mostly confined to the inner workings of individual cellular processes, rather than describing the manifold interactions between them in the context of a living cell. Using cyanobacteria as model organisms, this contribution seeks to summarize existing computational models that are relevant to describe phototrophic growth and seeks to outline their interactions and dependencies. Our ultimate aim is to understand cellular functioning and growth as the outcome of a coordinated operation of diverse yet interconnected cellular processes.Peer Reviewe

    Infobiotics : computer-aided synthetic systems biology

    Get PDF
    Until very recently Systems Biology has, despite its stated goals, been too reductive in terms of the models being constructed and the methods used have been, on the one hand, unsuited for large scale adoption or integration of knowledge across scales, and on the other hand, too fragmented. The thesis of this dissertation is that better computational languages and seamlessly integrated tools are required by systems and synthetic biologists to enable them to meet the significant challenges involved in understanding life as it is, and by designing, modelling and manufacturing novel organisms, to understand life as it could be. We call this goal, where everything necessary to conduct model-driven investigations of cellular circuitry and emergent effects in populations of cells is available without significant context-switching, “one-pot” in silico synthetic systems biology in analogy to “one-pot” chemistry and “one-pot” biology. Our strategy is to increase the understandability and reusability of models and experiments, thereby avoiding unnecessary duplication of effort, with practical gains in the efficiency of delivering usable prototype models and systems. Key to this endeavour are graphical interfaces that assists novice users by hiding complexity of the underlying tools and limiting choices to only what is appropriate and useful, thus ensuring that the results of in silico experiments are consistent, comparable and reproducible. This dissertation describes the conception, software engineering and use of two novel software platforms for systems and synthetic biology: the Infobiotics Workbench for modelling, in silico experimentation and analysis of multi-cellular biological systems; and DNA Library Designer with the DNALD language for the compact programmatic specification of combinatorial DNA libraries, as the first stage of a DNA synthesis pipeline, enabling methodical exploration biological problem spaces. Infobiotics models are formalised as Lattice Population P systems, a novel framework for the specification of spatially-discrete and multi-compartmental rule-based models, imbued with a stochastic execution semantics. This framework was developed to meet the needs of real systems biology problems: hormone transport and signalling in the root of Arabidopsis thaliana, and quorum sensing in the pathogenic bacterium Pseudomonas aeruginosa. Our tools have also been used to prototype a novel synthetic biological system for pattern formation, that has been successfully implemented in vitro. Taken together these novel software platforms provide a complete toolchain, from design to wet-lab implementation, of synthetic biological circuits, enabling a step change in the scale of biological investigations that is orders of magnitude greater than could previously be performed in one in silico “pot”

    Infobiotics : computer-aided synthetic systems biology

    Get PDF
    Until very recently Systems Biology has, despite its stated goals, been too reductive in terms of the models being constructed and the methods used have been, on the one hand, unsuited for large scale adoption or integration of knowledge across scales, and on the other hand, too fragmented. The thesis of this dissertation is that better computational languages and seamlessly integrated tools are required by systems and synthetic biologists to enable them to meet the significant challenges involved in understanding life as it is, and by designing, modelling and manufacturing novel organisms, to understand life as it could be. We call this goal, where everything necessary to conduct model-driven investigations of cellular circuitry and emergent effects in populations of cells is available without significant context-switching, “one-pot” in silico synthetic systems biology in analogy to “one-pot” chemistry and “one-pot” biology. Our strategy is to increase the understandability and reusability of models and experiments, thereby avoiding unnecessary duplication of effort, with practical gains in the efficiency of delivering usable prototype models and systems. Key to this endeavour are graphical interfaces that assists novice users by hiding complexity of the underlying tools and limiting choices to only what is appropriate and useful, thus ensuring that the results of in silico experiments are consistent, comparable and reproducible. This dissertation describes the conception, software engineering and use of two novel software platforms for systems and synthetic biology: the Infobiotics Workbench for modelling, in silico experimentation and analysis of multi-cellular biological systems; and DNA Library Designer with the DNALD language for the compact programmatic specification of combinatorial DNA libraries, as the first stage of a DNA synthesis pipeline, enabling methodical exploration biological problem spaces. Infobiotics models are formalised as Lattice Population P systems, a novel framework for the specification of spatially-discrete and multi-compartmental rule-based models, imbued with a stochastic execution semantics. This framework was developed to meet the needs of real systems biology problems: hormone transport and signalling in the root of Arabidopsis thaliana, and quorum sensing in the pathogenic bacterium Pseudomonas aeruginosa. Our tools have also been used to prototype a novel synthetic biological system for pattern formation, that has been successfully implemented in vitro. Taken together these novel software platforms provide a complete toolchain, from design to wet-lab implementation, of synthetic biological circuits, enabling a step change in the scale of biological investigations that is orders of magnitude greater than could previously be performed in one in silico “pot”

    Novel modeling formalisms and simulation tools in computational biosystems

    Get PDF
    Tese de doutoramento em BioengenhariaThe goal of Systems Biology is to understand the complex behavior that emerges from the interaction among the cellular components. Industrial biotechnology is one of the areas of application, where new approaches for metabolic engineering are developed, through the creation of new models and tools for simulation and optimization of the microbial metabolism. Although whole-cell modeling is one of the goals of Systems Biology, so far most models address only one kind of biological network independently. This work explores the integration of di erent kinds of biological networks with a focus on the improvement of simulation of cellular metabolism. The bacterium Escherichia coli is the most well characterized model organism and is used as our case-study. An extensive review of modeling formalisms that have been used in Systems Biology is presented in this work. It includes several formalisms, including Boolean networks, Bayesian networks, Petri nets, process algebras, constraint-based models, di erential equations, rule-based models, interacting state machines, cellular automata and agent-based models. We compare the features provided by these formalisms and classify the most suitable ones for the creation of a common framework for modeling, analysis and simulation of integrated biological networks. Currently, there is a separation between dynamic and constraint-based modeling of metabolism. Dynamic models are based on detailed kinetic reconstructions of central metabolic pathways, whereas constraint-based models are based on genome-scale stoichiometric reconstructions. Here, we explore the gap between both formulations and evaluate how dynamic models can be used to reduce the solution space of constraint-based models in order to eliminate kinetically infeasible solutions. The limitations of both kinds of models are leading to new approaches to build kinetic models at the genome-scale. The generation of kinetic models from stoichiometric reconstructions can be performed within the same framework as a transformation from discrete to continuous Petri nets. However, the size of these networks results in models with a large number of parameters. In this scope, we develop and implement structural reduction methods that adjust the level of detail of metabolic networks without loss of information, which can be applied prior to the kinetic inference to build dynamic models with a smaller number of parameters. In order to account for enzymatic regulation, which is not present in constraint-based models, we propose the utilization of Extended Petri nets. This results in a better sca old for the kinetic inference process. We evaluate the impact of accounting for enzymatic regulation in the simulation of the steady-state phenotype of mutant strains by performing knockouts and adjustment of enzyme expression levels. It can be observed that in some cases the impact is signi cant and may reveal new targets for rational strain design. In summary, we have created a solid framework with a common formalism and methods for metabolic modeling. This will facilitate the integration with gene regulatory networks, as we have already addressed many issues also associated with these networks, such as the trade-o between size and detail, and the representation of regulatory interactions.O objectivo da Biologia de Sistemas é compreender os comportamentos que resultam das complexas interacções entre todos os componentes celulares. A biotecnologia industrial é uma das áreas de aplicação, onde novas abordagens para a engenharia metabólica são desenvolvidas através da criação de novos modelos e ferramentas de simulação e optimização do metabolismo microbiano. Apesar de um dos principais objectivos da Biologia de Sistemas ser a criação de um modelo completo de uma célula, até ao momento a maioria dos modelos desenvolvidos incorpora de forma separada cada tipo de rede biológica. Este trabalho explora a integração de diferentes tipos de redes biológicas, focando melhorar a simulação do metabolismo celular. A bactéria Escherichia coli é o organismo modelo que estáa melhor caracterizado e é usado como caso de estudo. Neste trabalho é elaborada uma extensa revisão dos formalismos de modela ção que têm sido utilizados em Biologia de Sistemas. São considerados vários formalismos tais como, redes Booleanas, redes Bayesianas, redes de Petri, álgebras de processos, modelos baseados em restrições, equações diferenciais, modelos baseados em regras, máquinas de interacção de estados, autómatos celulares e modelos baseados em agentes. As funcionalidades inerentes a estes formalismos são analisadas de forma a classificar os mesmos pelo seu potencial em servir de base à criação de uma plataforma para modela ção, análise e simulação de redes biológicas integradas. Actualmente, existe uma separação entre modelação dinâmica e modelação baseada em restrições para o metabolismo celular. Os modelos dinâmicos consistem em reconstruções cinéticas detalhadas de vias centrais do metabolismo, enquanto que os modelos baseados em restrições são construídos à escala genómica com base apenas na estequiometria das reacçõoes. Neste trabalho explora-se a separação entre os dois tipos de formulação e é avaliada a forma como os modelos dinâmicos podem ser utilizados para reduzir o espaço de soluções de modelos baseados em restrições de forma a eliminar soluções inalcançáveis. As limitações impostas por ambos os tipos de modelos estão a conduzir à criação de novas abordagens para a construção de modelos cinéticos à escala genómica. A geração de modelos cinéticos a partir de reconstruções estequiométricas pode ser feita dentro de um mesmo formalismo através da transformação de redes de Petri discretas em redes de Petri contínuas. No entanto, devido ao tamanho destas redes, os modelos resultantes incluem um número extremamente grande de parâmetros. Neste trabalho são implementados métodos para a redução estrutural de redes metabólicas sem perda de informação, que permitem ajustar o nível de detalhe das redes. Estes métodos podem ser aplicados à inferência de cinéticas, de forma a gerar modelos dinâmicos com um menor número de parâmetros. De forma a considerar efeitos de regulação enzimática, os quais não são representados em modelos baseados em restrições, propõe-se a utilização de redes de Petri complementadas com arcos regulatórios. Este formalismo é utilizado como base para o processo de inferência cinética. A influência da regulação enzimática na determinação do estado estacionário de estirpes mutantes é avaliada através da análise da remoção de reacções e da variação dos níveis de expressão enzimática. Observa-se que em alguns casos esta influência é significativa e pode ser utilizada para obter novas estratégias de manipulação de estirpes. Em suma, neste trabalho foi criada uma plataforma sólida para modelação do metabolismo baseada num formalismo comum. Esta plataforma facilitará a integração com redes de regulação genética, pois foram abordados vários problemas que também se colocam nestas redes, tais como o ajuste entre o tamanho da rede e o seu nível de detalhe, bem como a representação de interacções regulatórias entre componentes da rede
    corecore