Search CORE

7 research outputs found

The ontology of biological sequences.

Author: Herre Heinrich
Hoehndorf Robert
Kelso Janet
Publication venue
Publication date: 18/11/2009
Field of study

BACKGROUND: Biological sequences play a major role in molecular and computational biology. They are studied as information-bearing entities that make up DNA, RNA or proteins. The Sequence Ontology, which is part of the OBO Foundry, contains descriptions and definitions of sequences and their properties. Yet the most basic question about sequences remains unanswered: what kind of entity is a biological sequence? An answer to this question benefits formal ontologies that use the notion of biological sequences and analyses in computational biology alike. RESULTS: We provide both an ontological analysis of biological sequences and a formal representation that can be used in knowledge-based applications and other ontologies. We distinguish three distinct kinds of entities that can be referred to as "biological sequence": chains of molecules, syntactic representations such as those in biological databases, and the abstract information-bearing entities. For use in knowledge-based applications and inclusion in biomedical ontologies, we implemented the developed axiom system for use in automated theorem proving. CONCLUSION: Axioms are necessary to achieve the main goal of ontologies: to formally specify the meaning of terms used within a domain. The axiom system for the ontology of biological sequences is the first elaborate axiom system for an OBO Foundry ontology and can serve as starting point for the development of more formal ontologies and ultimately of knowledge-based applications

Aberystwyth Research Portal

PubMed Central

MPG.PuRe

Ontologies in Quantitative Biology: A Basis for Comparison, Integration, and Discovery

Author: A. G Murzin
A. H Renear
B. R Zeeberg
C Perez-Iratxeta
C. A Orengo
D. A Hosack
D. R Swanson
E. L Sonnhammer
F Al-Shahrour
G Joshi-Tope
H Ogata
J Schulz
K. D Dahlquist
L Montecchi-Palazzi
L. J Jensen
L. J Lu
Lars J. Jensen
M Campillos
M Selbach
M. E Aranguren
M. V Blagosklonny
N. L Washington
Peer Bork
R Hoehndorf
R. L Tatusov
S Kerrien
S. W Doniger
T Attwood
T. R Gruber
W. R Taylor
Publication venue: Public Library of Science
Publication date: 01/05/2010
Field of study

As biology is becoming a data-driven discipline, ontologies become increasingly important for systematically capturing the existing knowledge. This essay discusses current trends and how ontologies can also be used for discovery

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Copenhagen University Research Information System

MDC Repository

Evolution of the Sequence Ontology terms and relationships

Author: Batchelor Colin
Eilbeck Karen
Mungall Christopher J.
Publication venue: 'Elsevier BV'
Publication date: 01/02/2011
Field of study

AbstractThe Sequence Ontology is an established ontology, with a large user community, for the purpose of genomic annotation. We are reforming the ontology to provide better terms and relationships to describe the features of biological sequence, for both genomic and derived sequence. The SO is working within the guidelines of the OBO Foundry to provide interoperability between SO and the other related OBO ontologies. Here, we report changes and improvements made to SO including new relationships to better define the mereological, spatial and temporal aspects of biological sequence

Elsevier - Publisher Connector

PubMed Central

Ontology design patterns to disambiguate relations between genes and gene products in GENIA

Author: Hoehndorf Robert
Ngonga Ngomo Axel-Cyrille
Oellrich Anika
Ohta Tomoko
Pyysalo Sampo
Rebholz-Schuhmann Dietrich
Publication venue
Publication date: 01/01/2011
Field of study

MOTIVATION: Annotated reference corpora play an important role in biomedical information extraction. A semantic annotation of the natural language texts in these reference corpora using formal ontologies is challenging due to the inherent ambiguity of natural language. The provision of formal definitions and axioms for semantic annotations offers the means for ensuring consistency as well as enables the development of verifiable annotation guidelines. Consistent semantic annotations facilitate the automatic discovery of new information through deductive inferences. RESULTS: We provide a formal characterization of the relations used in the recent GENIA corpus annotations. For this purpose, we both select existing axiom systems based on the desired properties of the relations within the domain and develop new axioms for several relations. To apply this ontology of relations to the semantic annotation of text corpora, we implement two ontology design patterns. In addition, we provide a software application to convert annotated GENIA abstracts into OWL ontologies by combining both the ontology of relations and the design patterns. As a result, the GENIA abstracts become available as OWL ontologies and are amenable for automated verification, deductive inferences and other knowledge-based applications. AVAILABILITY: Documentation, implementation and examples are available from http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/

Crossref

Aberystwyth Research Portal

Springer - Publisher Connector

PubMed Central

King's Research Portal

Apollo (Cambridge)

Preserving sequence annotations across reference sequences

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

EFindSite: Improved prediction of ligand binding sites in protein models using meta-threading, machine learning and auxiliary ligands

Author: Brylinski Michal
Feinstein Wei P.
Publication venue: LSU Digital Commons
Publication date: 01/06/2013
Field of study

Molecular structures and functions of the majority of proteins across different species are yet to be identified. Much needed functional annotation of these gene products often benefits from the knowledge of protein-ligand interactions. Towards this goal, we developed eFindSite, an improved version of FINDSITE, designed to more efficiently identify ligand binding sites and residues using only weakly homologous templates. It employs a collection of effective algorithms, including highly sensitive meta-threading approaches, improved clustering techniques, advanced machine learning methods and reliable confidence estimation systems. Depending on the quality of target protein structures, eFindSite outperforms geometric pocket detection algorithms by 15-40 % in binding site detection and by 5-35 % in binding residue prediction. Moreover, compared to FINDSITE, it identifies 14 % more binding residues in the most difficult cases. When multiple putative binding pockets are identified, the ranking accuracy is 75-78 %, which can be further improved by 3-4 % by including auxiliary information on binding ligands extracted from biomedical literature. As a first across-genome application, we describe structure modeling and binding site prediction for the entire proteome of Escherichia coli. Carefully calibrated confidence estimates strongly indicate that highly reliable ligand binding predictions are made for the majority of gene products, thus eFindSite holds a significant promise for large-scale genome annotation and drug development projects. eFindSite is freely available to the academic community at http://www.brylinski.org/efindsite. © 2013 Springer Science+Business Media Dordrecht

Crossref

Louisiana State University

Cancer signaling networks and their implications for personalized medicine

Author: Creixell Pau
Publication venue: Technical University of Denmark
Publication date: 01/01/2013
Field of study

Online Research Database In Technology