252 research outputs found
Minimum Information About a Simulation Experiment (MIASE)
Reproducibility of experiments is a basic requirement for science. Minimum Information (MI) guidelines have proved a helpful means of enabling reuse of existing work in modern biology. The Minimum Information Required in the Annotation of Models (MIRIAM) guidelines promote the exchange and reuse of biochemical computational models. However, information about a model alone is not sufficient to enable its efficient reuse in a computational setting. Advanced numerical algorithms and complex modeling workflows used in modern computational biology make reproduction of simulations difficult. It is therefore essential to define the core information necessary to perform simulations of those models. The Minimum Information About a Simulation Experiment (MIASE, Glossary in Box 1) describes the minimal set of information that must be provided to make the description of a simulation experiment available to others. It includes the list of models to use and their modifications, all the simulation procedures to apply and in which order, the processing of the raw numerical results, and the description of the final output. MIASE allows for the reproduction of any simulation experiment. The provision of this information, along with a set of required models, guarantees that the simulation experiment represents the intention of the original authors. Following MIASE guidelines will thus improve the quality of scientific reporting, and will also allow collaborative, more distributed efforts in computational modeling and simulation of biological processes
Ontologies for use in Systems Biology: SBO, KiSAO and TEDDY
The use of computational modelling in the description and analysis of biological systems is at the heart of Systems Biology. Besides the information stored in a core model, there is increasingly a need to provide additional semantic information: to identify model components, to assist in biological interpretation of models, to define simulation conditions and to describe simulation results. This information deficit can be addressed through the use of ontologies. We describe here three ontologies created specifically to address the needs of the Systems Biology community in each sub-division, and illustrate their practical use with the 'Repressilator' model (Elowitz and Leibler, 2000)
Annotation-based feature extraction from sets of SBML models
Background: Model repositories such as BioModels Database provide computational models of biological systems for the scientific community. These models contain rich semantic annotations that link model entities to concepts in well-established bio-ontologies such as Gene Ontology. Consequently, thematically similar models are likely to share similar annotations. Based on this assumption, we argue that semantic annotations are a suitable tool to characterize sets of models. These characteristics improve model classification, allow to identify additional features for model retrieval tasks, and enable the comparison of sets of models. Results: In this paper we discuss four methods for annotation-based feature extraction from model sets. We tested all methods on sets of models in SBML format which were composed from BioModels Database. To characterize each of these sets, we analyzed and extracted concepts from three frequently used ontologies, namely Gene Ontology, ChEBI and SBO. We find that three out of the methods are suitable to determine characteristic features for arbitrary sets of models: The selected features vary depending on the underlying model set, and they are also specific to the chosen model set. We show that the identified features map on concepts that are higher up in the hierarchy of the ontologies than the concepts used for model annotations. Our analysis also reveals that the information content of concepts in ontologies and their usage for model annotation do not correlate. Conclusions: Annotation-based feature extraction enables the comparison of model sets, as opposed to existing methods for model-to-keyword comparison, or model-to-model comparison
Annotation-based storage and retrieval of models and simulation descriptions in computational biology
This work aimed at enhancing reuse of computational biology models by identifying and formalizing relevant meta-information. One type of meta-information investigated in this thesis is experiment-related meta-information attached to a model, which is necessary to accurately recreate simulations. The main results are: a detailed concept for model annotation, a proposed format for the encoding of simulation experiment setups, a storage solution for standardized model representations and the development of a retrieval concept.Die vorliegende Arbeit widmete sich der besseren Wiederverwendung biologischer Simulationsmodelle. Ziele waren die Identifikation und Formalisierung relevanter Modell-Meta-Informationen, sowie die Entwicklung geeigneter Modellspeicherungs- und Modellretrieval-Konzepte.
Wichtigste Ergebnisse der Arbeit sind ein detailliertes Modellannotationskonzept, ein Formatvorschlag für standardisierte Kodierung von Simulationsexperimenten in XML, eine Speicherlösung für Modellrepräsentationen sowie ein Retrieval-Konzept
STON: exploring biological pathways using the SBGN standard and graph databases
BACKGROUND: When modeling in Systems Biology and Systems Medicine, the data is often extensive, complex and heterogeneous. Graphs are a natural way of representing biological networks. Graph databases enable efficient storage and processing of the encoded biological relationships. They furthermore support queries on the structure of biological networks. RESULTS: We present the Java-based framework STON (SBGN TO Neo4j). STON imports and translates metabolic, signalling and gene regulatory pathways represented in the Systems Biology Graphical Notation into a graph-oriented format compatible with the Neo4j graph database. CONCLUSION: STON exploits the power of graph databases to store and query complex biological pathways. This advances the possibility of: i) identifying subnetworks in a given pathway; ii) linking networks across different levels of granularity to address difficulties related to incomplete knowledge representation at single level; and iii) identifying common patterns between pathways in the database. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1394-x) contains supplementary material, which is available to authorized users
Reproducible computational biology experiments with SED-ML - The Simulation Experiment Description Markup Language
Background: The increasing use of computational simulation experiments to inform modern biological research creates new challenges to annotate, archive, share and reproduce such experiments. The recently published Minimum Information About a Simulation Experiment (MIASE) proposes a minimal set of information that should be provided to allow the reproduction of simulation experiments among users and software tools.
Results: In this article, we present the Simulation Experiment Description Markup Language (SED-ML). SED-ML encodes in a computer-readable exchange format the information required by MIASE to enable reproduction of simulation experiments. It has been developed as a community project and it is defined in a detailed technical specification and additionally provides an XML schema. The version of SED-ML described in this publication is Level 1 Version 1. It covers the description of the most frequent type of simulation experiments in the area, namely time course simulations. SED-ML documents specify which models to use in an experiment, modifications to apply on the models before using them, which simulation procedures to run on each model, what analysis results to output, and how the results should be presented. These descriptions are independent of the underlying model implementation. SED-ML is a software-independent format for encoding the description of simulation experiments; it is not specific to particular simulation tools. Here, we demonstrate that with the growing software support for SED-ML we can effectively exchange executable simulation descriptions.
Conclusions: With SED-ML, software can exchange simulation experiment descriptions, enabling the validation and reuse of simulation experiments in different tools. Authors of papers reporting simulation experiments can make their simulation protocols available for other scientists to reproduce the results. Because SED-ML is agnostic about exact modeling language(s) used, experiments covering models from different fields of research can be accurately described and combined
Specifications of Standards in Systems and Synthetic Biology: Status and Developments in 2016
Standards are essential to the advancement of science and technology. In systems and synthetic biology, numerous standards and associated tools have been developed over the last 16 years. This special issue of the Journal of Integrative Bioinformatics aims to support the exchange, distribution and archiving of these standards, as well as to provide centralised and easily citable access to them
Experiences From FAIRifying Community Data and FAIR Infrastructure in Biomedical Research Domains
FAIR data is considered good data. However, it can be difficult to quantify data FAIRness objectively, without appropriate tooling. To address this issue, FAIR metrics were developed in the early days of the FAIR era. However, to be truly informative, these metrics must be carefully interpreted in the context of a specific domain, and sometimes even of a project. Here, we share our experience with FAIR assessments and FAIRification processes in the biomedical domain. We aim to raise the awareness that “being FAIR” is not an easy goal, neither the principles are easily implemented. FAIR goes far beyond technical implementations: it requires time, expertise, communication and a shift in mindset. 
Combining computational models, semantic annotations, and 1 associated simulation experiments in a graph database PrePrints
Abstract 13 Model repositories such as the BioModels Database or the CellML Model Repository are 14 frequently accessed to retrieve computational models describing biological systems. However, 15 the current designs of these databases limit the types of supported queries, and many data 16 in these repositories cannot easily be accessed. Computational methods for model retrieval 17 cannot be applied. In this paper we present a storage concept that meets this challenge. It 18 grounds on a graph database, reects the models' structure, incorporates semantic annotations 19 and experiment descriptions, and ultimately connects dierent types of model-related data. 20 The connections between heterogeneous model-related data and bio-ontologies enable ecient 21 search via biological facts and grant access to new model features such as network structure. 22 The introduced concept notably improves the access of computational models and associate
- …
