Search CORE

4,874 research outputs found

Bridging the gap between social tagging and semantic annotation: E.D. the Entity Describer

Author: Benjamin M. Good
Edward A. Kawas
Mark D. Wilkinson
Publication venue
Publication date: 07/09/2007
Field of study

Semantic annotation enables the development of efficient computational methods for analyzing and interacting with information, thus maximizing its value. With the already substantial and constantly expanding data generation capacity of the life sciences as well as the concomitant increase in the knowledge distributed in scientific articles, new ways to produce semantic annotations of this information are crucial. While automated techniques certainly facilitate the process, manual annotation remains the gold standard in most domains. In this manuscript, we describe a prototype mass-collaborative semantic annotation system that, by distributing the annotation workload across the broad community of biomedical researchers, may help to produce the volume of meaningful annotations needed by modern biomedical science. We present E.D., the Entity Describer, a mashup of the Connotea social tagging system, an index of semantic web-accessible controlled vocabularies, and a new public RDF database for storing social semantic annotations

Nature Precedings

The development of non-coding RNA ontology

Author: Blake Judith
de Silva Nisansa
Dou Deijing
Eilbeck Karen
Huan Jun
Huang Jingshan
Huang Weili
Jiang Guoqian
Kasukurthi Mohan Vamsi
Lin Yu
Natale Darren
Ruttenberg Alan
Smith Barry
Strachan Harrison
Wu Bin
Zimmermann Michael
Publication venue
Publication date: 01/01/2016
Field of study

Identification of non-coding RNAs (ncRNAs) has been significantly improved over the past decade. On the other hand, semantic annotation of ncRNA data is facing critical challenges due to the lack of a comprehensive ontology to serve as common data elements and data exchange standards in the field. We developed the Non-Coding RNA Ontology (NCRO) to handle this situation. By providing a formally defined ncRNA controlled vocabulary, the NCRO aims to fill a specific and highly needed niche in semantic annotation of large amounts of ncRNA biological and clinical data

PhilPapers

Searching with Tags: Do Tags Help Users Find Things?

Author: Campbell D. Grant
Kipp Margaret E.I.
Publication venue
Publication date: 01/01/2010
Field of study

This study examines the question of whether tags can be useful in the process of information retrieval. Participants searched a social bookmarking tool specialising in academic articles (CiteULike) and an online journal database (Pubmed). Participant actions were captured using screen capture software and they were asked to describe their search process. Users did make use of tags in their search process, as a guide to searching and as hyperlinks to potentially useful articles. However, users also made use of controlled vocabularies in the journal database to locate useful search terms and of links to related articles supplied by the database

E-LIS

Infectious Disease Ontology

Technological developments have resulted in tremendous increases in the volume and diversity of the data and information that must be processed in the course of biomedical and clinical research and practice. Researchers are at the same time under ever greater pressure to share data and to take steps to ensure that data resources are interoperable. The use of ontologies to annotate data has proven successful in supporting these goals and in providing new possibilities for the automated processing of data and information. In this chapter, we describe different types of vocabulary resources and emphasize those features of formal ontologies that make them most useful for computational applications. We describe current uses of ontologies and discuss future goals for ontology-based computing, focusing on its use in the field of infectious diseases. We review the largest and most widely used vocabulary resources relevant to the study of infectious diseases and conclude with a description of the Infectious Disease Ontology (IDO) suite of interoperable ontology modules that together cover the entire infectious disease domain

PhilPapers

CiteSeerX

Crossref

Open Data Platform for Knowledge Access in Plant Health Domain : VESPA Mining

Author: Andro Mathieu
Corbière Roselyne
Phan Tien T.
Turenne Nicolas
Publication venue
Publication date: 01/01/2015
Field of study

Important data are locked in ancient literature. It would be uneconomic to produce these data again and today or to extract them without the help of text mining technologies. Vespa is a text mining project whose aim is to extract data on pest and crops interactions, to model and predict attacks on crops, and to reduce the use of pesticides. A few attempts proposed an agricultural information access. Another originality of our work is to parse documents with a dependency of the document architecture

arXiv.org e-Print Archive

CiteSeerX

ProdInra

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL-Rennes 1

HAL - UPEC / UPEM

NASA automatic subject analysis technique for extracting retrievable multi-terms (NASA TERM) system

Author: Kirschbaum J.
Williamson R. E.
Publication venue
Publication date
Field of study

Current methods for information processing and retrieval used at the NASA Scientific and Technical Information Facility are reviewed. A more cost effective computer aided indexing system is proposed which automatically generates print terms (phrases) from the natural text. Satisfactory print terms can be generated in a primarily automatic manner to produce a thesaurus (NASA TERMS) which extends all the mappings presently applied by indexers, specifies the worth of each posting term in the thesaurus, and indicates the areas of use of the thesaurus entry phrase. These print terms enable the computer to determine which of several terms in a hierarchy is desirable and to differentiate ambiguous terms. Steps in the NASA TERMS algorithm are discussed and the processing of surrogate entry phrases is demonstrated using four previously manually indexed STAR abstracts for comparison. The simulation shows phrase isolation, text phrase reduction, NASA terms selection, and RECON display

NASA Technical Reports Server

Developing the Quantitative Histopathology Image Ontology : A case study using the hot spot detection problem

Author: Doyle Scott
Gurcan Metin
James A.
John Overton
N. Tomaszewski
Ruttenberg Alan
Smith Barry
Publication venue
Publication date: 01/01/2017
Field of study

Interoperability across data sets is a key challenge for quantitative histopathological imaging. There is a need for an ontology that can support effective merging of pathological image data with associated clinical and demographic data. To foster organized, cross-disciplinary, information-driven collaborations in the pathological imaging field, we propose to develop an ontology to represent imaging data and methods used in pathological imaging and analysis, and call it Quantitative Histopathological Imaging Ontology – QHIO. We apply QHIO to breast cancer hot-spot detection with the goal of enhancing reliability of detection by promoting the sharing of data between image analysts

PhilPapers

SLIS Student Research Journal, Vol. 5, Iss. 1

Author
Publication venue: SJSU ScholarWorks
Publication date: 24/10/2016
Field of study

SJSU ScholarWorks

Discovering Beaten Paths in Collaborative Ontology-Engineering Projects using Markov Chains

Author: Musen Mark A.
Noy Natalya F.
Singer Philipp
Strohmaier Markus
Tudorache Tania
Walk Simon
Publication venue: 'Elsevier BV'
Publication date: 29/02/2016
Field of study

Biomedical taxonomies, thesauri and ontologies in the form of the International Classification of Diseases (ICD) as a taxonomy or the National Cancer Institute Thesaurus as an OWL-based ontology, play a critical role in acquiring, representing and processing information about human health. With increasing adoption and relevance, biomedical ontologies have also significantly increased in size. For example, the 11th revision of the ICD, which is currently under active development by the WHO contains nearly 50,000 classes representing a vast variety of different diseases and causes of death. This evolution in terms of size was accompanied by an evolution in the way ontologies are engineered. Because no single individual has the expertise to develop such large-scale ontologies, ontology-engineering projects have evolved from small-scale efforts involving just a few domain experts to large-scale projects that require effective collaboration between dozens or even hundreds of experts, practitioners and other stakeholders. Understanding how these stakeholders collaborate will enable us to improve editing environments that support such collaborations. We uncover how large ontology-engineering projects, such as the ICD in its 11th revision, unfold by analyzing usage logs of five different biomedical ontology-engineering projects of varying sizes and scopes using Markov chains. We discover intriguing interaction patterns (e.g., which properties users subsequently change) that suggest that large collaborative ontology-engineering projects are governed by a few general principles that determine and drive development. From our analysis, we identify commonalities and differences between different projects that have implications for project managers, ontology editors, developers and contributors working on collaborative ontology-engineering projects and tools in the biomedical domain.Comment: Published in the Journal of Biomedical Informatic

arXiv.org e-Print Archive

CiteSeerX

Applications of the ACGT Master Ontology on Cancer

Author: A. Cali
A. Gómez-Pérez
A.L. Rector
B. Smith
B. Smith
C. Wittekind
J.E. Andrews
J.J. Cimino
K.H. Buetow
L. Martín
M. Tsiknakis
P. Grenon
W. Ceusters
W.A. Schulz
Publication venue
Publication date: 01/01/2008
Field of study

In this paper we present applications of the ACGT Master Ontology (MO) which is a new terminology resource for a transnational network providing data exchange in oncology, emphasizing the integration of both clinical and molecular data. The development of a new ontology was necessary due to problems with existing biomedical ontologies in oncology. The ACGT MO is a test case for the application of best practices in ontology development. This paper provides an overview of the application of the ontology within the ACGT project thus far