Search CORE

678 research outputs found

Automatic annotation of bioinformatics workflows with biomedical ontologies

Author: B. Smith
B.P. Vandervalk
D. Sáchez
D. Withers
J. Ison
M.D. Wilkinson
M.D. Wilkinson
P. Lord
P. Rice
S. Harispe
T. Oinn
U. Radetzki
Publication venue
Publication date: 01/01/2014
Field of study

Legacy scientific workflows, and the services within them, often present scarce and unstructured (i.e. textual) descriptions. This makes it difficult to find, share and reuse them, thus dramatically reducing their value to the community. This paper presents an approach to annotating workflows and their subcomponents with ontology terms, in an attempt to describe these artifacts in a structured way. Despite a dearth of even textual descriptions, we automatically annotated 530 myExperiment bioinformatics-related workflows, including more than 2600 workflow-associated services, with relevant ontological terms. Quantitative evaluation of the Information Content of these terms suggests that, in cases where annotation was possible at all, the annotation quality was comparable to manually curated bioinformatics resources.Comment: 6th International Symposium on Leveraging Applications (ISoLA 2014 conference), 15 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Application of Semantics to Solve Problems in Life Sciences

Author: García Godoy María Jesús
Publication venue: UMA Editorial
Publication date: 01/01/2018
Field of study

Fecha de lectura de Tesis: 10 de diciembre de 2018La cantidad de información que se genera en la Web se ha incrementado en los últimos años. La mayor parte de esta información se encuentra accesible en texto, siendo el ser humano el principal usuario de la Web. Sin embargo, a pesar de todos los avances producidos en el área del procesamiento del lenguaje natural, los ordenadores tienen problemas para procesar esta información textual. En este cotexto, existen dominios de aplicación en los que se están publicando grandes cantidades de información disponible como datos estructurados como en el área de las Ciencias de la Vida. El análisis de estos datos es de vital importancia no sólo para el avance de la ciencia, sino para producir avances en el ámbito de la salud. Sin embargo, estos datos están localizados en diferentes repositorios y almacenados en diferentes formatos que hacen difícil su integración. En este contexto, el paradigma de los Datos Vinculados como una tecnología que incluye la aplicación de algunos estándares propuestos por la comunidad W3C tales como HTTP URIs, los estándares RDF y OWL. Haciendo uso de esta tecnología, se ha desarrollado esta tesis doctoral basada en cubrir los siguientes objetivos principales: 1) promover el uso de los datos vinculados por parte de la comunidad de usuarios del ámbito de las Ciencias de la Vida 2) facilitar el diseño de consultas SPARQL mediante el descubrimiento del modelo subyacente en los repositorios RDF 3) crear un entorno colaborativo que facilite el consumo de Datos Vinculados por usuarios finales, 4) desarrollar un algoritmo que, de forma automática, permita descubrir el modelo semántico en OWL de un repositorio RDF, 5) desarrollar una representación en OWL de ICD-10-CM llamada Dione que ofrezca una metodología automática para la clasificación de enfermedades de pacientes y su posterior validación haciendo uso de un razonador OWL

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Institucional Universidad de Málaga

A Semantic Problem Solving Environment for Integrative Parasite Research: Identification of Intervention Targets for Trypanosoma cruzi

Author: A Bernstein
A Brazma
A Ruttenberg
A Ruttenberg
AA Ackermann
Aaron R. Jex
Amir H. Asiaee
Amit P. Sheth
B Chukualim
BM Good
C Aurrecoechea
C Bizer
C Blaschke
C Goble
C Goble
C Hertz-Fowler
D Xu
E Antezana
E Sirin
H Dietze
H Lam
H Tang
J Bhagat
J Luciano
J Malone
JA Atwood
JC Jeremy
K Christoph
K Eilbeck
K-H Cheung
M Ashburner
M Aslett
M Johnson
M Kanehisa
NM El-Sayed
P Hitzler
P Mendes
PR Smart
Prashant Doshi
Priti P. Parikh
R Brinkman
Rick Tarleton
Sarasi Lalithsena
Satya S. Sahoo
SS Sahoo
SS Sahoo
SS Sahoo
SS Sahoo
T Minning
TA Minning
Todd A. Minning
V Cross
V Petri
Vinh Nguyen
W Hersh
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Effective research in parasite biology requires analyzing experimental lab data in the context of constantly expanding public data resources. Integrating lab data with public resources is particularly difficult for biologists who may not possess significant computational skills to acquire and process heterogeneous data stored at different locations. Therefore, we develop a semantic problem solving environment (SPSE) that allows parasitologists to query their lab data integrated with public resources using ontologies. An ontology specifies a common vocabulary and formal relationships among the terms that describe an organism, and experimental data and processes in this case. SPSE supports capturing and querying provenance information, which is metadata on the experimental processes and data recorded for reproducibility, and includes a visual query-processing tool to formulate complex queries without learning the query language syntax. We demonstrate the significance of SPSE in identifying gene knockout targets for T. cruzi. The overall goal of SPSE is to help researchers discover new or existing knowledge that is implicitly present in the data but not always easily detected. Results demonstrate improved usefulness of SPSE over existing lab systems and approaches, and support for complex query design that is otherwise difficult to achieve without the knowledge of query language syntax

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Scholar Commons - Institutional Repository of the University of South Carolina

CORE

AMMO-Prot: amine system project 3D-model finder

Author: Aldana-Montes José F
Montañez Raúl
Moya-García Aurelio A
Navas-Delgado Ismael
Pino-Ángeles Almudena
Sánchez-Jiménez Francisca
Urdiales José Luis
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

BACKGROUND: Amines are biogenic amino acid derivatives, which play pleiotropic and very important yet complex roles in animal physiology. For many other relevant biomolecules, biochemical and molecular data are being accumulated, which need to be integrated in order to be effective in the advance of biological knowledge in the field. For this purpose, a multidisciplinary group has started an ontology-based system named the Amine System Project (ASP) for which amine-related information is the validation bench. RESULTS: In this paper, we describe the Ontology-Based Mediator developed in the Amine System Project (http://asp.uma.es) using the infrastructure of Semantic Directories, and how this system has been used to solve a case related to amine metabolism-related protein structures. CONCLUSIONS: This infrastructure is used to publish and manage not only ontologies and their relationships, but also metadata relating to the resources committed with the ontologies. The system developed is available at http://asp.uma.es/WebMediator

Crossref

Springer - Publisher Connector

PubMed Central

UCL Discovery

BioXSD: the common data-exchange format for everyday bioinformatics web services

Author: A. Joseph
A. Topfer
Altschul
Ashburner
C. Blanchet
Duckert
E. Bartaseviciute
Eilbeck
Garnier
Gouveia-Oliveira
Han
Hermjakob
Hucka
Hull
I. Jonassen
J. C. Bryne
J. Ison
K. Rapacki
Kottmann
M. Kalas
P. Puntervoll
P. Venkataraman
Prlic
S. Pettifer
Seibel
Stein
Wilkinson
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

Motivation: The world-wide community of life scientists has access to a large number of public bioinformatics databases and tools, which are developed and deployed using diverse technologies and designs. More and more of the resources offer programmatic web-service interface. However, efficient use of the resources is hampered by the lack of widely used, standard data-exchange formats for the basic, everyday bioinformatics data types

University of Bergen

Crossref

PubMed Central

The University of Manchester - Institutional Repository

NORA - Norwegian Open Research Archives

Online Research Database In Technology

SSWAP: A Simple Semantic Web Architecture and Protocol for semantic web services

Author: B Adida
B McBride
CA Joslyn
CA Joslyn
Christopher D Town
D De Roure
Damian DG Gessler
David Grant
DDG Gessler
E Sayers
E Sirin
G Benson
G Klyne
Gary S Schiltz
Gene Ontology Consortium
Greg D May
I Kant
M Wilkinson
O Ritter
P Karp
P Lord
Rex T Nelson
RJ Robbins
S Pillai
SA McIlraith
SB Davidson
Shulamit Avraham
SY Rhee
T Berners-Lee
VM Markowitz
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background SSWAP (Simple Semantic Web Architecture and Protocol; pronounced "swap") is an architecture, protocol, and platform for using reasoning to semantically integrate heterogeneous disparate data and services on the web. SSWAP was developed as a hybrid semantic web services technology to overcome limitations found in both pure web service technologies and pure semantic web technologies. Results There are currently over 2400 resources published in SSWAP. Approximately two dozen are custom-written services for QTL (Quantitative Trait Loci) and mapping data for legumes and grasses (grains). The remaining are wrappers to Nucleic Acids Research Database and Web Server entries. As an architecture, SSWAP establishes how clients (users of data, services, and ontologies), providers (suppliers of data, services, and ontologies), and discovery servers (semantic search engines) interact to allow for the description, querying, discovery, invocation, and response of semantic web services. As a protocol, SSWAP provides the vocabulary and semantics to allow clients, providers, and discovery servers to engage in semantic web services. The protocol is based on the W3C-sanctioned first-order description logic language OWL DL. As an open source platform, a discovery server running at <url>http://sswap.info</url> (as in to "swap info") uses the description logic reasoner Pellet to integrate semantic resources. The platform hosts an interactive guide to the protocol at <url>http://sswap.info/protocol.jsp</url>, developer tools at <url>http://sswap.info/developer.jsp</url>, and a portal to third-party ontologies at <url>http://sswapmeet.sswap.info</url> (a "swap meet"). Conclusion SSWAP addresses the three basic requirements of a semantic web services architecture (<it>i.e</it>., a common syntax, shared semantic, and semantic discovery) while addressing three technology limitations common in distributed service systems: <it>i.e</it>., <it>i</it>) the fatal mutability of traditional interfaces, <it>ii</it>) the rigidity and fragility of static subsumption hierarchies, and <it>iii</it>) the confounding of content, structure, and presentation. SSWAP is novel by establishing the concept of a canonical yet mutable OWL DL graph that allows data and service providers to describe their resources, to allow discovery servers to offer semantically rich search engines, to allow clients to discover and invoke those resources, and to allow providers to respond with semantically tagged data. SSWAP allows for a mix-and-match of terms from both new and legacy third-party ontologies in these graphs.</p

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Arizona

Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases

Author: Akihiro Matsushima
Antezana
Ashburner
Belleau
Berners-Lee
Berners-Lee
Demir
Hamosh
Iida
Jaiswal
Jenkinson
Kawaji
Laibe
Manabu Ishii
Masuya
Norio Kobayashi
Satoshi Takahashi
Swarbreck
Tetsuro Toyoda
The UniProt Consortium
Toyoda
Yoshiki Mochizuki
Publication venue: Oxford University Press
Publication date
Field of study

Global cloud frameworks for bioinformatics research databases become huge and heterogeneous; solutions face various diametric challenges comprising cross-integration, retrieval, security and openness. To address this, as of March 2011 organizations including RIKEN published 192 mammalian, plant and protein life sciences databases having 8.2 million data records, integrated as Linked Open or Private Data (LOD/LPD) using SciNetS.org, the Scientists' Networking System. The huge quantity of linked data this database integration framework covers is based on the Semantic Web, where researchers collaborate by managing metadata across public and private databases in a secured data space. This outstripped the data query capacity of existing interface tools like SPARQL. Actual research also requires specialized tools for data analysis using raw original data. To solve these challenges, in December 2009 we developed the lightweight Semantic-JSON interface to access each fragment of linked and raw life sciences data securely under the control of programming languages popularly used by bioinformaticians such as Perl and Ruby. Researchers successfully used the interface across 28 million semantic relationships for biological applications including genome design, sequence processing, inference over phenotype databases, full-text search indexing and human-readable contents like ontology and LOD tree viewers. Semantic-JSON services of SciNetS.org are provided at http://semanticjson.org

Crossref

PubMed Central

Knowledge sharing and collaboration in translational research, and the DC-THERA Directory

Author: A. Splendiani
Ashburner
Ball
Bard
Bard
Barrett
Bourne
Brazma
Brazma
Burks
C. Scognamiglio
Ciccarese
D. Cavalieri
Demir
Etzold
Gardiner-Garden
Gardner
Goble
Grenon
Hoehndorf
Howe
Hu
Hucka
Hull
J. M. Austyn
Kinoshita
M. Brandizi
M. Gundel
Masci
Mons
Mons
Rebholz-Schuhmann
Roberts
Rubin
Shotton
Silvertown
Smith
Stein
Stein
Szalay
Webster
Wilkinson
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Biomedical research relies increasingly on large collections of data sets and knowledge whose generation, representation and analysis often require large collaborative and interdisciplinary efforts. This dimension of ‘big data’ research calls for the development of computational tools to manage such a vast amount of data, as well as tools that can improve communication and access to information from collaborating researchers and from the wider community. Whenever research projects have a defined temporal scope, an additional issue of data management arises, namely how the knowledge generated within the project can be made available beyond its boundaries and life-time. DC-THERA is a European ‘Network of Excellence’ (NoE) that spawned a very large collaborative and interdisciplinary research community, focusing on the development of novel immunotherapies derived from fundamental research in dendritic cell immunobiology. In this article we introduce the DC-THERA Directory, which is an information system designed to support knowledge management for this research community and beyond. We present how the use of metadata and Semantic Web technologies can effectively help to organize the knowledge generated by modern collaborative research, how these technologies can enable effective data management solutions during and beyond the project lifecycle, and how resources such as the DC-THERA Directory fit into the larger context of e-science

Crossref

PubMed Central

Oxford University Research Archive

Rothamsted Repository

Interval Neutrosophic Sets and Logic: Theory and Applications in Computing

Author: Wang Haibin
Publication venue: ScholarWorks @ Georgia State University
Publication date: 01/01/2005
Field of study

A neutrosophic set is a part of neutrosophy that studies the origin, nature, and scope of neutralities, as well as their interactions with different ideational spectra. The neutrosophic set is a powerful general formal framework that has been recently proposed. However, the neutrosophic set needs to be specified from a technical point of view. Here, we define the set-theoretic operators on an instance of a neutrosophic set, and call it an Interval Neutrosophic Set (INS). We prove various properties of INS, which are connected to operations and relations over INS. We also introduce a new logic system based on interval neutrosophic sets. We study the interval neutrosophic propositional calculus and interval neutrosophic predicate calculus. We also create a neutrosophic logic inference system based on interval neutrosophic logic. Under the framework of the interval neutrosophic set, we propose a data model based on the special case of the interval neutrosophic sets called Neutrosophic Data Model. This data model is the extension of fuzzy data model and paraconsistent data model. We generalize the set-theoretic operators and relation-theoretic operators of fuzzy relations and paraconsistent relations to neutrosophic relations. We propose the generalized SQL query constructs and tuple-relational calculus for Neutrosophic Data Model. We also design an architecture of Semantic Web Services agent based on the interval neutrosophic logic and do the simulation study

CiteSeerX

ScholarWorks @ Georgia State University