Search CORE

95 research outputs found

BiOnt: Deep Learning using Multiple Biomedical Ontologies for Relation Extraction

Author: A Lamurias
AC Yu
B Xu
F Li
FM Couto
J Hastings
J Li
LM Schriml
M Ashburner
M Herrero-Zazo
R Campaner
S Köhler
T Mikolov
The Gene Ontology Consortium
W Bechtel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/04/2020
Field of study

Successful biomedical relation extraction can provide evidence to researchers and clinicians about possible unknown associations between biomedical entities, advancing the current knowledge we have about those entities and their inherent mechanisms. Most biomedical relation extraction systems do not resort to external sources of knowledge, such as domain-specific ontologies. However, using deep learning methods, along with biomedical ontologies, has been recently shown to effectively advance the biomedical relation extraction field. To perform relation extraction, our deep learning system, BiOnt, employs four types of biomedical ontologies, namely, the Gene Ontology, the Human Phenotype Ontology, the Human Disease Ontology, and the Chemical Entities of Biological Interest, regarding gene-products, phenotypes, diseases, and chemical compounds, respectively. We tested our system with three data sets that represent three different types of relations of biomedical entities. BiOnt achieved, in F-score, an improvement of 4.93 percentage points for drug-drug interactions (DDI corpus), 4.99 percentage points for phenotype-gene relations (PGR corpus), and 2.21 percentage points for chemical-induced disease relations (BC5CDR corpus), relatively to the state-of-the-art. The code supporting this system is available at https://github.com/lasigeBioTM/BiOnt.Comment: ECIR 202

arXiv.org e-Print Archive

Crossref

Medical Subject Heading (MeSH) annotations illuminate maize genetics and evolution

Author: AJ Lorenz
CE Lipscomb
CN Hirsch
D Maglott
G Morota
G Morota
Gene Ontology Consortium
Gota Morota
H Wang
H Wang
J Doebley
J Dorweiler
K Tsuyuzaki
L du Plessis
LD Gottlieb
LM Schriml
M Ashburner
M Kanehisa
MB Hufford
MD Rausher
N De Leon
N Škunca
P Pavlidis
PJ Brown
PS Schnable
R Balakrishnan
R Maita
RS Sekhon
S Durinck
S Falcon
T Nakazato
T Ogura
Timothy M. Beissinger
TM Beissinger
W Huber
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Network-based analysis of genetic variants associated with hippocampal volume in Alzheimer’s disease: a study of ADNI cohorts

Author: A Hamosh
Aaron K. Wong
AE Sedeno-Cortes
Ailin Song
AJ Saykin
Andrew J. Saykin
C Ballard
Casey S. Greene
CR Jack Jr
JA Webster
JB Wright
JC Lambert
Jingwen Yan
JZ Liu
Li Shen
LM Schriml
LM Shaw
RC Petersen
S Kim
Shannon Leigh Risacher
SL Risacher
Sungeun Kim
WJ Jagust
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The clinical trial landscape in oncology and connectivity of somatic mutational profiles to targeted therapies

Author: AB Turke
DB Costa
F Spagnolo
G Ananda
GR Brown
H Yasuda
JT Dunnen den
KA Gray
KW Robinson
LM Schriml
M Gymnopoulos
M Sana
N Roper
R Ramakrishna
S Maio Di
SA Forbes
V Petri
Y Yuan
YJ Choi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Text Mining the History of Medicine

Author: A Henriksson
AR Aronson
C Mihăilă
Carsten Timmermann
D Lopresti
D McClosky
Elizabeth Toon
G Hripcsak
G Schneider
Georgios Kontonatsios
H Moen
H Suominen
J Cohen
J-D Kim
Jacob Carter
John McNaught
JR Firth
K Bontcheva
KB Wagholikar
L Kelly
LM Schriml
Luis M. Rocha
M Miwa
M Miwa
M Ruiz-Casado
M Worboys
MA Hearst
Michael Worboys
N Alnazzawi
O Bodenreider
P Murrieta-Flores
P Thompson
Paul Thompson
R Prasad
RI Dogan
Riza Theresa Batista-Navarro
S Jonnalagadda
S Pyysalo
S Zhang
Sophia Ananiadou
T Hitchcock
TH Tanner
Y Tsuruoka
Y Tsuruoka
Y Tsuruoka
Y Wang
Z Liu
ZS Harris
Ö Uzuner
Ö Uzuner
Ö Uzuner
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 06/01/2016
Field of study

Historical text archives constitute a rich and diverse source of information, which is becoming increasingly readily accessible, due to large-scale digitisation efforts. However, it can be difficult for researchers to explore and search such large volumes of data in an efficient manner. Text mining (TM) methods can help, through their ability to recognise various types of semantic information automatically, e.g., instances of concepts (places, medical conditions, drugs, etc.), synonyms/variant forms of concepts, and relationships holding between concepts (which drugs are used to treat which medical conditions, etc.). TM analysis allows search systems to incorporate functionality such as automatic suggestions of synonyms of user-entered query terms, exploration of different concepts mentioned within search results or isolation of documents in which concepts are related in specific ways. However, applying TM methods to historical text can be challenging, according to differences and evolutions in vocabulary, terminology, language structure and style, compared to more modern text. In this article, we present our efforts to overcome the various challenges faced in the semantic analysis of published historical medical text dating back to the mid 19th century. Firstly, we used evidence from diverse historical medical documents from different periods to develop new resources that provide accounts of the multiple, evolving ways in which concepts, their variants and relationships amongst them may be expressed. These resources were employed to support the development of a modular processing pipeline of TM tools for the robust detection of semantic information in historical medical documents with varying characteristics. We applied the pipeline to two large-scale medical document archives covering wide temporal ranges as the basis for the development of a publicly accessible semantically-oriented search system. The novel resources are available for research purposes, while the processing pipeline and its modules may be used and configured within the Argo TM platform

Crossref

Directory of Open Access Journals

Edge Hill University Research Information Repository

PubMed Central

The University of Manchester - Institutional Repository

Using WormBase: A Genome Biology Resource for Caenorhabditis elegans and Related Nematodes

Author: A Kalderimis
A Mitchell
AG Alexander
AJ Bretscher
AJ Vilella
C Camacho
C Trapnell
C Trapnell
D Angeles-Albores
DB Rhee
E Culetto
G Schindelman
Gene Ontology Consortium
H Li
H Motenko
I Greenwald
I Lee
I Lee
J Giacomotto
J Li
J Zheng
J-F Rual
JS Amberger
K-W Park
KL Howe
LD Stein
LM Schriml
LP O’Reilly
M Artal-Sanz
MB Gerstein
ME Skinner
OE Blacque
P Gaudet
R Balakrishnan
R Lyne
R O’Hagan
RC Edgar
RD Finn
RN Smith
RP Huntley
RP Huntley
RS Kamath
RYN Lee
S Burge
S Contrino
S Powell
S-J Lee
SF Altschul
SF Altschul
The Gene Ontology Consortium
TW Harris
W Zhong
WA Kibbe
WJ Kent
Y Nakamura
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/05/2018
Field of study

WormBase (www.wormbase.org) provides the nematode research community with a centralized database for information pertaining to nematode genes and genomes. As more nematode genome sequences are becoming available and as richer data sets are published, WormBase strives to maintain updated information, displays, and services to facilitate efficient access to and understanding of the knowledge generated by the published nematode genetics literature. This chapter aims to provide an explanation of how to use basic features of WormBase, new features, and some commonly used tools and data queries. Explanations of the curated data and step-by-step instructions of how to access the data via the WormBase website and available data mining tools are provided

Crossref

Caltech Authors

Standardized metadata for human pathogen/vector genomic sequences

Author: Barrett T
Birren B
Brinkac L
Bruno VM
Caler E
Chapman S
Collins FH
Cuomo CA
Di Francesco V
Dugan VG
Durkin S
Emrich SJ
Eppinger M
Feldgarden M
Fraser C
Fricke WF
Giovanni M
Giraldo-Calderón GI
Harb OS
Henn MR
Hine E
Hotopp JD
Karsch-Mizrachi I
Kissinger JC
Lee EM
Mathur P
Mongodin EF
Murphy CI
Myers G
Neafsey DE
Nelson KE
Newman RM
Nierman WC
Pickett BE
Puzak J
Rasko D
Roos DS
Sadzewicz L
Scheuermann RH
Schriml LM
Silva JC
Singh I
Sobral B
Squires RB
Stevens RL
Stockwell TB
Stoeckert CJ
Sullivan DE
Tallon L
Tettelin H
Ward DV
Wentworth D
White O
Will R
Wortman J
Yao A
Zhang Y
Zheng J
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 17/06/2014
Field of study

High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs), the Bioinformatics Resource Centers (BRCs) for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium's minimal information (MIxS) and NCBI's BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI). The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant

OPUS - University of Technology Sydney