69 research outputs found
Identifiers.org and MIRIAM Registry: perennial identifiers for crossreferencing purposes
The MIRIAM Registry provides unique, perennial and location independent identifiers for data used in the biomedical domain. At its core is a shared catalogue of data collections, for each of which a unique namespace is created, and extensive metadata recorded. This namespace allows the generation of Uniform Resource Identifiers (URIs) to uniquely identify any record in a collection. We present here the new Identifiers.org service which is built upon the information stored in the Registry and which provides directly resolvable identifiers, in the form of Uniform Resource Locators (URLs). The flexibility of the identification scheme and resolving system allows its use in many different fields, where unambiguous and perennial identification of data entities is necessary
MIRIAM Resources: next steps
This presentation gives a short introduction of MIRIAM Resources ("http://www.ebi.ac.uk/miriam/":http://www.ebi.ac.uk/miriam/), a robust annotation and cross referencing framework, based on URNs.

The presentation also explains some potential issues with the current system and presents a possible solution.

Finally, the presentation introduces some forthcoming and suggested updates and extensions to the current infrastructure
Management and provision of computational models
Quantitative models of biological systems provide an understanding of chemical and biological phenomena based on their underlying mechanisms. Moreover, they can be used for example, to predict the behaviour of a system under given conditions or direct future experiments. This has made quantitative models the perfect tools to answer a variety of questions in the biological sciences and has lead to a steady growth of the number of published models.

To maximise the benefits of this growing body of models, the field needs centralised model repositories that will encourage, facilitate and promote model dissemination and reuse. BioModels Database(http://www.ebi.ac.uk/biomodels/) has been developed to exactly fulfil those needs. In order to ensure the correctness of the models distributed, their structure and behaviour are thoroughly checked. To ease their understanding, the model elements are annotated with terms from controlled vocabularies as well as linked to relevant data resources. Finally, to allow their reuse, the models are provided encoded in community supported and standardised formats.

However, the modelling field is constantly evolving and data providers, like BioModels Database, are faced with new challenges. For example, models are getting more and more complex (with for instance the availability of whole organism metabolic network reconstructions) and this has a direct impact on the performance of hosting infrastructures and annotation procedures. Also, models are now being developed collaboratively: this requires new methodologies and systems, akin to the ones used in software development (with for example versioned repositories of models). Moreover, very different kinds of models are being developed by diverse communities, but ultimately their data management needs are very similar.

This talk will introduce the needs which lead to the development of BioModels Database, present the resource and its current infrastructure and finally discuss the challenges that we are facing today and the plans to overcome them
BioModels Database: Next generation model repository
Public announce that the software system running BioModels Database ("http://www.ebi.ac.uk/biomodels/":http://www.ebi.ac.uk/biomodels/) will evolve from open source to a community developed project
Kinetic Simulation Algorithm Ontology
To enable the accurate and repeatable execution of a computational simulation task, it is important to identify both the algorithm used and the initial setup. These minimum information requirements are described by the MIASE guidelines. Since the details of some algorithms are not always publicly available, and many are implemented only in a limited number of simulation tools, it is crucial to identify alternative algorithms with similar characteristics that may be used to provide comparable results in an equivalent simulation experiment. The Kinetic Simulation Algorithm Ontology (KiSAO) was developed to address this issue by describing existing algorithms and their inter-relationships through their characteristics and parameters. The use of KiSAO in conjunction with simulation descriptions, such as SED-ML, will allow simulation software to automatically choose the best algorithm available to perform a simulation. The availability of algorithm parameters, together with their type may permit the automatic generation of user-interfaces to configure simulators. To enable making queries to KiSAO programmaticaly, from simulation experiment description editors and simulation tools, a java library libKiSAO was implemented
BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models
Background: Quantitative models of biochemical and cellular systems are used to answer a variety of questions in the
biological sciences. The number of published quantitative models is growing steadily thanks to increasing interest in
the use of models as well as the development of improved software systems and the availability of better, cheaper
computer hardware. To maximise the benefits of this growing body of models, the field needs centralised model
repositories that will encourage, facilitate and promote model dissemination and reuse. Ideally, the models stored in
these repositories should be extensively tested and encoded in community-supported and standardised formats. In
addition, the models and their components should be cross-referenced with other resources in order to allow their
unambiguous identification.
Description: BioModels Database http://www.ebi.ac.uk/biomodels/ is aimed at addressing exactly these needs. It is a
freely-accessible online resource for storing, viewing, retrieving, and analysing published, peer-reviewed quantitative
models of biochemical and cellular systems. The structure and behaviour of each simulation model distributed by
BioModels Database are thoroughly checked; in addition, model elements are annotated with terms from controlled
vocabularies as well as linked to relevant data resources. Models can be examined online or downloaded in various
formats. Reaction network diagrams generated from the models are also available in several formats. BioModels
Database also provides features such as online simulation and the extraction of components from large scale models
into smaller submodels. Finally, the system provides a range of web services that external software systems can use to
access up-to-date data from the database.
Conclusions: BioModels Database has become a recognised reference resource for systems biology. It is being used by
the community in a variety of ways; for example, it is used to benchmark different simulation systems, and to study the
clustering of models based upon their annotations. Model deposition to the database today is advised by several
publishers of scientific journals. The models in BioModels Database are freely distributed and reusable; the underlying
software infrastructure is also available from SourceForge https://sourceforge.net/projects/biomodels/ under the GNU
General Public License
SPARQL-enabled identifier conversion with Identifiers.org
Motivation: On the semantic web, in life sciences in particular, data is often distributed via multiple resources. Each of these sources is likely to use their own International Resource Identifier for conceptually the same resource or database record. The lack of correspondence between identifiers introduces a barrier when executing federated SPARQL queries across life science data. Results: We introduce a novel SPARQL-based service to enable on-the-fly integration of life science data. This service uses the identifier patterns defined in the Identifiers.org Registry to generate a plurality of identifier variants, which can then be used to match source identifiers with target identifiers. We demonstrate the utility of this identifier integration approach by answering queries across major producers of life science Linked Data. Availability and implementation: The SPARQL-based identifier conversion service is available without restriction at http://identifiers.org/services/sparql. Contact: [email protected]
The EBI RDF platform: linked open data for the life sciences
Motivation: Resource description framework (RDF) is an emerging technology for describing, publishing and linking life science data. As a major provider of bioinformatics data and services, the European Bioinformatics Institute (EBI) is committed to making data readily accessible to the community in ways that meet existing demand. The EBI RDF platform has been developed to meet an increasing demand to coordinate RDF activities across the institute and provides a new entry point to querying and exploring integrated resources available at the EBI. Availability: http://www.ebi.ac.uk/rdf Contact: [email protected]
One file to share them all: Using the COMBINE Archive and the OMEX format to share all information about a modeling project
Background: With the ever increasing use of computational models in the
biosciences, the need to share models and reproduce the results of published
studies efficiently and easily is becoming more important. To this end, various
standards have been proposed that can be used to describe models, simulations,
data or other essential information in a consistent fashion. These constitute
various separate components required to reproduce a given published scientific
result.
Results: We describe the Open Modeling EXchange format (OMEX). Together with
the use of other standard formats from the Computational Modeling in Biology
Network (COMBINE), OMEX is the basis of the COMBINE Archive, a single file that
supports the exchange of all the information necessary for a modeling and
simulation experiment in biology. An OMEX file is a ZIP container that includes
a manifest file, listing the content of the archive, an optional metadata file
adding information about the archive and its content, and the files describing
the model. The content of a COMBINE Archive consists of files encoded in
COMBINE standards whenever possible, but may include additional files defined
by an Internet Media Type. Several tools that support the COMBINE Archive are
available, either as independent libraries or embedded in modeling software.
Conclusions: The COMBINE Archive facilitates the reproduction of modeling and
simulation experiments in biology by embedding all the relevant information in
one file. Having all the information stored and exchanged at once also helps in
building activity logs and audit trails. We anticipate that the COMBINE Archive
will become a significant help for modellers, as the domain moves to larger,
more complex experiments such as multi-scale models of organs, digital
organisms, and bioengineering.Comment: 3 figures, 1 tabl
BioModels: ten-year anniversary
BioModels (http://www.ebi.ac.uk/biomodels/) is a repository of mathematical models of biological processes. A large set of models is curated to verify both correspondence to the biological process that the model seeks to represent, and reproducibility of the simulation results as described in the corresponding peer-reviewed publication. Many models submitted to the database are annotated, cross-referencing its components to external resources such as database records, and terms from controlled vocabularies and ontologies. BioModels comprises two main branches: one is composed of models derived from literature, while the second is generated through automated processes. BioModels currently hosts over 1200 models derived directly from the literature, as well as in excess of 140 000 models automatically generated from pathway resources. This represents an approximate 60-fold growth for literature-based model numbers alone, since BioModels’ first release a decade ago. This article describes updates to the resource over this period, which include changes to the user interface, the annotation profiles of models in the curation pipeline, major infrastructure changes, ability to perform online simulations and the availability of model content in Linked Data form. We also outline planned improvements to cope with a diverse array of new challenges
- …