Search CORE

31 research outputs found

The OBO Foundry: Coordinated Evolution of Ontologies to Support Biomedical Data Integration

Author: Ashburner Michael
Bard Jonathan
Bug William
Ceusters Werner
Eilbeck Karen
Goldberg Louis J
Ireland Amelia
Leontis Neocles
Lewissi Suzanna
Mungall Christopher J
OBI Consortium The
Rocca‐Serra a b
Rosse Cornelius
Ruttenberg Alan
Sansone Susanna-Assunta
Shah Migam
Smith Barry
Whetzel Patricia L
Publication venue
Publication date: 01/01/2007
Field of study

The value of any kind of data is greatly enhanced when it exists in a form that allows it to be integrated with other data. One approach to integration is through the annotation of multiple bodies of data using common controlled vocabularies or ‘ontologies’. Unfortunately, the very success of this approach has led to a proliferation of ontologies, which itself creates obstacles to integration. The Open Biomedical Ontologies (OBO) consortium has set in train a strategy to overcome this problem. Existing OBO ontologies, including the Gene Ontology, are undergoing a process of coordinated reform, and new ontologies being created, on the basis of an evolving set of shared principles governing ontology development. The result is an expanding family of ontologies designed to be interoperable, logically well-formed, and to incorporate accurate representations of biological reality. We describe the OBO Foundry initiative, and provide guidelines for those who might wish to become involved in the future

University of Washington Structural Informatics Group Publications

E-BioFlow: Different Perspectives on Scientific Workflows

Author: Breit T.
Nijholt A.
Rauwerda H.
Vet P.E. van der
Wassink I.
Publication venue: Springer Verlag
Publication date: 01/01/2008
Field of study

We introduce a new type of workflow design system called\ud e-BioFlow and illustrate it by means of a simple sequence alignment workflow. E-BioFlow, intended to model advanced scientific workflows, enables the user to model a workflow from three different but strongly coupled perspectives: the control flow perspective, the data flow perspective, and the resource perspective. All three perspectives are of\ud equal importance, but workflow designers from different domains prefer different perspectives as entry points for their design, and a single workflow designer may prefer different perspectives in different stages of workflow design. Each perspective provides its own type of information, visualisation and support for validation. Combining these three perspectives in a single application provides a new and flexible way of modelling workflows

University of Twente Research Information

International Migration, Integration and Social Cohesion online publications

Survey-based naming conventions for use in OBO Foundry ontology development

Author: Chris F.
Daniel Schober
E. Kusnierczyk
Lomax Waclaw
Mungall Jane
Philippe Rocca-Serra
Smith Barry
Susanna-Assunta Sansone
Suzanna Lewis
Taylor Chris
Publication venue
Publication date: 01/01/2009
Field of study

A wide variety of ontologies relevant to the biological and medical domains are available through the OBO Foundry portal, and their number is growing rapidly. Integration of these ontologies, while requiring considerable effort, is extremely desirable. However, heterogeneities in format and style pose serious obstacles to such integration. In particular, inconsistencies in naming conventions can impair the readability and navigability of ontology class hierarchies, and hinder their alignment and integration. While other sources of diversity are tremendously complex and challenging, agreeing a set of common naming conventions is an achievable goal, particularly if those conventions are based on lessons drawn from pooled practical experience and surveys of community opinion. We summarize a review of existing naming conventions and highlight certain disadvantages with respect to general applicability in the biological domain. We also present the results of a survey carried out to establish which naming conventions are currently employed by OBO Foundry ontologies and to determine what their special requirements regarding the naming of entities might be. Lastly, we propose an initial set of typographic, syntactic and semantic conventions for labelling classes in OBO Foundry ontologies. Adherence to common naming conventions is more than just a matter of aesthetics. Such conventions provide guidance to ontology creators, help developers avoid flaws and inaccuracies when editing, and especially when interlinking, ontologies. Common naming conventions will also assist consumers of ontologies to more readily understand what meanings were intended by the authors of ontologies used in annotating bodies of data

PhilPapers

ProServer: a simple, extensible Perl DAS server

Author: Birney
D. K. Jackson
Dowell
E. Kulesha
Finn
J. Clements
J. W. Stalker
Olason
Prlic
R. D. Finn
R. Pettett
Yeats
Publication venue: Oxford University Press
Publication date
Field of study

Summary: The increasing size and complexity of biological databases has led to a growing trend to federate rather than duplicate them. In order to share data between federated databases, protocols for the exchange mechanism must be developed. One such data exchange protocol that is widely used is the Distributed Annotation System (DAS). For example, DAS has enabled small experimental groups to integrate their data into the Ensembl genome browser. We have developed ProServer, a simple, lightweight, Perl-based DAS server that does not depend on a separate HTTP server. The ProServer package is easily extensible, allowing data to be served from almost any underlying data model. Recent additions to the DAS protocol have enabled both structure and alignment (sequence and structural) data to be exchanged. ProServer allows both of these data types to be served

Crossref

PubMed Central

Chemical Entities of Biological Interest: an update

Author: Adriano Dekker
Alcántara
Batchelor
Chang
Christoph Steinbeck
Corbett
Degtyarenko
Dimitropoulos
Inmaculada Spiteri
Janna Hastings
Kenneth Haug
Le Novère
Marcus Ennis
Matthews
Parkinson
Paula de Matos
Rafael Alcántara
Rijnbeek
Smith
Stein
Steinbeck
Steinbeck
Steve Turner
Wang
Wittig
Publication venue: Oxford University Press
Publication date
Field of study

Chemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities focused on ‘small’ chemical compounds. The molecular entities in question are either natural products or synthetic products used to intervene in the processes of living organisms. Genome-encoded macromolecules (nucleic acids, proteins and peptides derived from proteins by cleavage) are not as a rule included in ChEBI. In addition to molecular entities, ChEBI contains groups (parts of molecular entities) and classes of entities. ChEBI includes an ontological classification, whereby the relationships between molecular entities or classes of entities and their parents and/or children are specified. ChEBI is available online at http://www.ebi.ac.uk/chebi/. This article reports on new features in ChEBI since the last NAR report in 2007, including substructure and similarity searching, a submission tool for authoring of ChEBI datasets by the community and a 30-fold increase in the number of chemical structures stored in ChEBI

Crossref

PubMed Central

Finding and sharing: new approaches to registries of databases and services for the biomedical sciences

The recent explosion of biological data and the concomitant proliferation of distributed databases make it challenging for biologists and bioinformaticians to discover the best data resources for their needs, and the most efficient way to access and use them. Despite a rapid acceleration in uptake of syntactic and semantic standards for interoperability, it is still difficult for users to find which databases support the standards and interfaces that they need. To solve these problems, several groups are developing registries of databases that capture key metadata describing the biological scope, utility, accessibility, ease-of-use and existence of web services allowing interoperability between resources. Here, we describe some of these initiatives including a novel formalism, the Database Description Framework, for describing database operations and functionality and encouraging good database practise. We expect such approaches will result in improved discovery, uptake and utilization of data resources. Database URL: http://www.casimir.org.uk/casimir_dd

Aberystwyth Research Portal

University of Groningen

University of Birmingham Research Portal

Spiral - Imperial College Digital Repository

ART

Proceedings - University of Groningen

Crossref

ARTS repository - University of Groningen

PubMed Central

Apollo (Cambridge)

Open Repository and Bibliography - Luxembourg

Leicester Research Archive

Dissertations of the University of Groningen

Finding representative nodes in probabilistic graphs

Author: Langohr Laura
Toivonen Hannu
Publication venue
Publication date: 01/01/2009
Field of study

Peer reviewe

CiteSeerX

Helsingin yliopiston digitaalinen arkisto

SNP-Converter: an Ontology-Based solution to Reconcile Heterogeneous SNP Descriptions for Pharmacogenomic Studies

Author: Benlian Pascale
Coulet Adrien
Devignes Marie-Dominique
Napoli Amedeo
Smaïl-Tabbone Malika
Publication venue: HAL CCSD
Publication date: 20/07/2006
Field of study

Pharmacogenomics explores the impact of individual genomic variations in health problems such as adverse drug reactions. Records of millions of genomic variations, mostly known as Single Nucleotide Polymorphisms (SNP), are available today in various overlapping and heterogeneous databases. Selecting and extracting from these databases or from private sources a proper set of polymorphisms are the first steps of a KDD (Knowledge Discovery in Databases) process in pharmacogenomics. It is however a tedious task hampered by the heterogeneity of SNP nomenclatures and annotations. Standards for representing genomic variants have been proposed by the Human Genome Variation Society (HGVS). The SNP-Converter application is aimed at converting any SNP description into an HGVS-compliant pivot description and vice versa. Used in the frame of a knowledge system, the SNP-Converter application contributes as a wrapper to semantic data integration and enrichment

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Meta-All: a system for managing metabolic pathway information

Author: B Boehm
BH Junker
BH Junker
BH Junker
Björn H Junker
C Lemer
Christian Klukas
CJ Date
Dirk Koschützki
DL Wheeler
Falk Schreiber
G Joshi-Tope
HM Sauro
I Schomburg
Ivo Grosse
LBM Ellis
M Fowler
M Hucka
M Kanehisa
MY Galperin
P Dadam
P Geigenberger
P Mendes
P Zhang
S Philippi
Stephan Weise
The Plant Ontology Consortium
U Wittig
U Wittig
Uwe Scholz
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Many attempts are being made to understand biological subjects at a systems level. A major resource for these approaches are biological databases, storing manifold information about DNA, RNA and protein sequences including their functional and structural motifs, molecular markers, mRNA expression levels, metabolite concentrations, protein-protein interactions, phenotypic traits or taxonomic relationships. The use of these databases is often hampered by the fact that they are designed for special application areas and thus lack universality. Databases on metabolic pathways, which provide an increasingly important foundation for many analyses of biochemical processes at a systems level, are no exception from the rule. Data stored in central databases such as KEGG, BRENDA or SABIO-RK is often limited to read-only access. If experimentalists want to store their own data, possibly still under investigation, there are two possibilities. They can either develop their own information system for managing that own data, which is very time-consuming and costly, or they can try to store their data in existing systems, which is often restricted. Hence, an out-of-the-box information system for managing metabolic pathway data is needed. RESULTS: We have designed META-ALL, an information system that allows the management of metabolic pathways, including reaction kinetics, detailed locations, environmental factors and taxonomic information. Data can be stored together with quality tags and in different parallel versions. META-ALL uses Oracle DBMS and Oracle Application Express. We provide the META-ALL information system for download and use. In this paper, we describe the database structure and give information about the tools for submitting and accessing the data. As a first application of META-ALL, we show how the information contained in a detailed kinetic model can be stored and accessed. CONCLUSION: META-ALL is a system for managing information about metabolic pathways. It facilitates the handling of pathway-related data and is designed to help biochemists and molecular biologists in their daily research. It is available on the Web at and can be downloaded free of charge and installed locally

KOPS - The Institutional Repository of the University of Konstanz

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Survey-based naming conventions for use in OBO Foundry ontology development

Author: Kusnierczyk Waclaw
Lewis Suzanna E
Lomax Jane
Mungall Chris
Rocca-Serra Philippe
Sansone Susanna-Assunta
Schober Daniel
Smith Barry
Taylor Chris F
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background A wide variety of ontologies relevant to the biological and medical domains are available through the OBO Foundry portal, and their number is growing rapidly. Integration of these ontologies, while requiring considerable effort, is extremely desirable. However, heterogeneities in format and style pose serious obstacles to such integration. In particular, inconsistencies in naming conventions can impair the readability and navigability of ontology class hierarchies, and hinder their alignment and integration. While other sources of diversity are tremendously complex and challenging, agreeing a set of common naming conventions is an achievable goal, particularly if those conventions are based on lessons drawn from pooled practical experience and surveys of community opinion. Results We summarize a review of existing naming conventions and highlight certain disadvantages with respect to general applicability in the biological domain. We also present the results of a survey carried out to establish which naming conventions are currently employed by OBO Foundry ontologies and to determine what their special requirements regarding the naming of entities might be. Lastly, we propose an initial set of typographic, syntactic and semantic conventions for labelling classes in OBO Foundry ontologies. Conclusion Adherence to common naming conventions is more than just a matter of aesthetics. Such conventions provide guidance to ontology creators, help developers avoid flaws and inaccuracies when editing, and especially when interlinking, ontologies. Common naming conventions will also assist consumers of ontologies to more readily understand what meanings were intended by the authors of ontologies used in annotating bodies of data.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central