Search CORE

7 research outputs found

Semantic annotation of morphological descriptions: an overall strategy

Author: A Taylor
D Kirkup
E Riloff
G Curry
G Diggs
G Sautter
H Cui
H Cui
H Cui
H Cui
H Cui
H Cui
Hong Cui
MM Wood
R Abascal
S Lydon
S Soderland
X Tang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Introducing Explorer of Taxon Concepts with a case study on spider measurement matrix building

Author
Publication venue: BioMed Central
Publication date: 17/11/2016
Field of study

Springer - Publisher Connector

Applications of Natural Language Processing in Biodiversity Science

Author: Cui Hong
Mozzherin Dmitry
Thessen Anne E.
Publication venue: Hindawi Publishing Corporation
Publication date: 01/01/2012
Field of study

Centuries of biological knowledge are contained in the massive body of scientific literature, written for human-readability but too big for any one person to consume. Large-scale mining of information from the literature is necessary if biology is to transform into a data-driven science. A computer can handle the volume but cannot make sense of the language. This paper reviews and discusses the use of natural language processing (NLP) and machine-learning algorithms to extract information from systematic literature. NLP algorithms have been used for decades, but require special development for application in the biological realm due to the special nature of the language. Many tools exist for biological information extraction (cellular processes, taxonomic names, and morphological characters), but none have been applied life wide and most still require testing and development. Progress has been made in developing algorithms for automated annotation of taxonomic text, identification of taxonomic names in text, and extraction of morphological character information from taxonomic descriptions. This manuscript will briefly discuss the key steps in applying information extraction tools to enhance biodiversity science

Crossref

Woods Hole Open Access Server

Directory of Open Access Journals

PubMed Central

Recommended from our members

Ontologies as Integrative Tools for Plant Science

Author: Athreya Balaji
Cooper Laurel
Elser Justin
Gandolfo Maria A.
Jaiswal Pankaj
Mungall Christopher J.
Preece Justin
Rensing Stefan
Smith Barry
Stevenson Dennis W.
Walls Ramona L.
Publication venue: 'Botanical Society of America'
Publication date
Field of study

Premise of the study: Bio-ontologies are essential tools for accessing and analyzing the rapidly growing pool of plant genomic and phenomic data. Ontologies provide structured vocabularies to support consistent aggregation of data and a semantic framework for automated analyses and reasoning. They are a key component of the semantic web. Methods: This paper provides background on what bio-ontologies are, why they are relevant to botany, and the principles of ontology development. It includes an overview of ontologies and related resources that are relevant to plant science, with a detailed description of the Plant Ontology (PO). We discuss the challenges of building an ontology that covers all green plants (Viridiplantae). Key results: Ontologies can advance plant science in four keys areas: (1) comparative genetics, genomics, phenomics, and development; (2) taxonomy and systematics; (3) semantic applications; and (4) education. Conclusions: Bio-ontologies offer a flexible framework for comparative plant biology, based on common botanical understanding. As genomic and phenomic data become available for more species, we anticipate that the annotation of data with ontology terms will become less centralized, while at the same time, the need for cross-species queries will become more common, causing more researchers in plant science to turn to ontologies.Keywords: Bio-ontologies, Plant Ontology, Plant genomics, OBO Foundry, Plant systematics, Plant anatomy, Genome annotation, Semantic web, PhenomicsKeywords: Bio-ontologies, Plant Ontology, Plant genomics, OBO Foundry, Plant systematics, Plant anatomy, Genome annotation, Semantic web, Phenomic

ScholarsArchive@OSU

Introducing Explorer of Taxon Concepts with a case study on spider measurement matrix building

Author: A Hardisty
AE Thessen
AR Deans
BC WorkShop
Bertram Ludäscher
DG Howe
Dongfang Xu
DR Maddison
Eduardo M. Soto
F Huang
FM Labarque
H Cui
H Cui
H Cui
H Cui
H Cui
H Cui
Hong Cui
J Liu
JA Blake
JA Miller
James A. Macklin
JB Bowes
JL Salle
JP Balhoff
L Màrquez
M Palmer
M Sevenster
Martin Ramirez
MJ Ramírez
MM Wood
Nicolás Mongiardino Koch
O Uzuner
PC Sereno
RA Vos
Robert A. Morris
RW Kiger
S Aisen
S Soderland
Steven S. Chong
T Catapano
Thomas Rodenhausen
Y Bradford
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Competency evaluation of plant character ontologies against domain literature

Author: Atkinson
Chakaravarthy
Chan
Dallwitz
Dallwitz
Flora of China Editorial Committee
Gruber
Gruber
Gómez-Pérez
Illic
Lenat
Lombard
Maedche
National Information Standards Organization
Pammer
Radford
Smith
Staab
Swartout
Watson
Zhou
Publication venue: 'Wiley'
Publication date: 01/01/2010
Field of study

Crossref

Recommended from our members

The classification of gene products in the molecular biology domain: Realism, objectivity, and the limitations of the Gene Ontology

Author: Mayor Charlie
Publication venue
Publication date
Field of study

Background: Controlled vocabularies in the molecular biology domain exist to facilitate data integration across database resources. One such tool is the Gene Ontology (GO), a classification designed to act as a universal index for gene products from any species. The Gene Ontology is used extensively in annotating gene products and analysing gene expression data, yet very little research exists from a library and information science perspective exploring the design principles, philosophy and social role of ontologies in biology. Aim: To explore how molecular biologists, in creating the Gene Ontology, devised guidelines and rules for determining which scientific concepts are included in the ontology, and the criteria for how these concepts are represented. Methods: A domain analysis approach was used to devise a mixed methodology to study the design of the Gene Ontology. Concept analysis of a GO term and a critical discourse analysis of GO developer mailing list texts were used to test whether ontological realism is a tenable basis for constructing objective ontologies. A comparison of the current GO vocabulary construction guidelines and a study of the reasons why GO terms are removed from the ontology further explored the justifications for the design of the Gene Ontology. Finally, a content analysis of published GO papers examined how authors use and cite GO data and terminology. Results: Gene Ontology terms can be presented according to different epistemologies for concepts, indicating that ontological realism is not the only way objective ontologies can be designed. Social roles and the exercise of power were found to play an important role in determining ontology content, and poor synonym control, a lack of clear warrant for deciding terminology and arbitrary decisions to delete and invent new terms undermine the objectivity and universal applicability of the Gene Ontology. Authors exhibited poor compliance with GO data citation policies, and in re-wording and misquoting GO terminology, risk exacerbating the semantic problems this controlled vocabulary was designed to solve. Conclusions: The failure of the Gene Ontology to define what is meant by a molecular function, the exercise of power by GO developers in clearing contentious concepts from the ontology, and the strict adherence to ontological realism, which marginalises social and subjective ways of classifying scientific concepts, limits the utility of the ontology as a tool to unify the molecular biology domain. These limitations to the Gene Ontology design could be overcome with the development of lighter, pluralistic, user-controlled ‘open ontologies’ for gene products that can work alongside more traditional, ‘top-down’ developed vocabularies

City Research Online