Search CORE

47 research outputs found

Sortal anaphora resolution to enhance relation extraction from biomedical literature

Author: A Haghighi
A Rahman
AR Aronson
AR Aronson
AT McCray
BJ Grosz
C Gasperin
CD Manning
CM Miller
D Hristovski
D Weissenbacher
E Hovy
G Hripscak
G Rosemblat
Graciela Rosemblat
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Lee
Halil Kilicoglu
I Segura-Bedmar
J Castaño
J Cohen
J D’Souza
J Zheng
JD Kim
JJ Kim
K Yoshikawa
KB Cohen
LH Smith
M Choi
M Miwa
M Torii
Marcelo Fiszman
NLT Nguyen
O Bodenreider
P Stenetorp
P Thompson
S Bergsma
S Lappin
S Pradhan
T Lavergne
TC Rindflesch
Thomas C. Rindflesch
V Ng
V Ng
WM Soon
X Yang
Y Kim
Y Xu
Ö Uzuner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Themes in biomedical natural language processing: BioNLP08

Author: K Verspoor
A Airola
A Roberts
P Corbett
Y Sasaki
X Wang
M Stevenson
Y Tsuruoka
V Vincze
H Kilicoglu
A Neveol
Publication venue: BioMed Central
Publication date: 01/01/1991
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Edinburgh Research Explorer

Institute of Mathematics AS CR, v. v. i.

Themes in biomedical natural language processing: BioNLP08

Author: A Airola
A Neveol
A Roberts
Bonnie Webber
Dina Demner-Fushman
H Kilicoglu
John Pestian
Jun'ichi Tsujii
K Bretonnel Cohen
K Verspoor
M Stevenson
P Corbett
Sophia Ananiadou
V Vincze
X Wang
Y Sasaki
Y Tsuruoka
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Edinburgh Research Explorer

Clustering cliques for graph-based summarization of the biomedical research literature

Author: A Naud
A Nenkova
A Ozgür
A Pons-Porrata
AR Aronson
AT McCray
AT McCray
Bartlomiej Wilkowski
C Wartena
Dongwook Shin
F Lerch
G Erkan
G Liu
GC Stein
H Kilicoglu
H Kilicoglu
H Yu
H Zhang
Han Zhang
I Mani
I Yoo
J Ah-Pine
J Goodwin
J Yang
JB Kruskal
K Sparck Jones
KW Boyack
L Smith
LH Reeve
LH Reeve
M Bundschus
M Fiszman
M Fiszman
M Kan
M Lee
Marcelo Fiszman
MG Everett
MJ Norusis
O Bodenreider
P Langfelder
P Tan
PJ Rousseeuw
R Mihalcea
SP Borgatti
T Matsunage
TC Rindflesch
TC Rindflesch
Thomas C Rindflesch
V an der Spek P Klusener S
V Batagelj
VD Blondel
X Liu
X Zhang
Y Yamamoto
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

BACKGROUND: Graph-based notions are increasingly used in biomedical data mining and knowledge discovery tasks. In this paper, we present a clique-clustering method to automatically summarize graphs of semantic predications produced from PubMed citations (titles and abstracts). RESULTS: SemRep is used to extract semantic predications from the citations returned by a PubMed search. Cliques were identified from frequently occurring predications with highly connected arguments filtered by degree centrality. Themes contained in the summary were identified with a hierarchical clustering algorithm based on common arguments shared among cliques. The validity of the clusters in the summaries produced was compared to the Silhouette-generated baseline for cohesion, separation and overall validity. The theme labels were also compared to a reference standard produced with major MeSH headings. CONCLUSIONS: For 11 topics in the testing data set, the overall validity of clusters from the system summary was 10% better than the baseline (43% versus 33%). While compared to the reference standard from MeSH headings, the results for recall, precision and F-score were 0.64, 0.65, and 0.65 respectively

Crossref

Springer - Publisher Connector

PubMed Central

Online Research Database In Technology

Detecting modification of biomedical events using a deep parsing approach

Author: A Copestake
A Copestake
A Copestake
A Frank
A MacKinlay
Andrew MacKinlay
B Medlock
C Pollard
D Flickinger
David Martinez
E Briscoe
E Buyko
E Velldal
G Móra
H Kilicoglu
H Uszkoreit
I Solt
J Björne
J Hakenberg
JD Kim
KB Cohen
P Adolphs
R Farkas
S Van Landeghem
Timothy Baldwin
U Callmeier
V Vincze
WW Chapman
Y Tsuruoka
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background This work describes a system for identifying event mentions in bio-molecular research abstracts that are either speculative (e.g. <it>analysis of IkappaBalpha phosphorylation</it>, where it is not specified whether phosphorylation did or did not occur) or negated (e.g. <it>inhibition of IkappaBalpha phosphorylation</it>, where phosphorylation did <it>not </it>occur). The data comes from a standard dataset created for the BioNLP 2009 Shared Task. The system uses a machine-learning approach, where the features used for classification are a combination of shallow features derived from the words of the sentences and more complex features based on the semantic outputs produced by a deep parser. Method To detect event modification, we use a Maximum Entropy learner with features extracted from the data relative to the trigger words of the events. The shallow features are bag-of-words features based on a small sliding context window of 3-4 tokens on either side of the trigger word. The deep parser features are derived from parses produced by the English Resource Grammar and the <it>RASP </it>parser. The outputs of these parsers are converted into the Minimal Recursion Semantics formalism, and from this, we extract features motivated by linguistics and the data itself. All of these features are combined to create training or test data for the machine learning algorithm. Results Over the test data, our methods produce approximately a 4% absolute increase in F-score for detection of event modification compared to a baseline based only on the shallow bag-of-words features. Conclusions Our results indicate that grammar-based techniques can enhance the accuracy of methods for detecting event modification.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

Improving EGM2008 by GPS and leveling data at local scale

Author: ABBAK R.A.
BENAHMED DAHOA S.A.
BENAHMED DAHOA S.A.
CORCHETE V.
EROL B.
FEATHERSTONE W. E.
FEATHERSTONE W.E.
FOGEL D.N
HARDY R. L.
KIAMEHR R.
KILICOGLU A.
KOTSAKIS C.
LAZZARO D
LEMOINE F. G.
MORITZ H.
OLLIKAINEN M.
PAVLIS N. K.
PAVLIS N.K
POTTMANN H.
SOYCAN A.
SOYCAN A.
SOYCAN M.
SOYCAN M.
TORGE W.
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

Overview of the ID, EPI and REL tasks of BioNLP Shared Task 2011

Author: A Morgan
A Riggs
A Vlachos
A Yeh
Bruno Sobral
C Arighi
C Nédellec
C Quirk
C Wang
CH Wei
CH Wu
Chunhong Mao
Chunxia Wang
D Barford
D McClosky
D McClosky
D McClosky
D McClosky
D Rebholz-Schuhmann
D Tikk
Dan Sullivan
DD Sleator
E Buyko
E Charniak
ES Witze
EW Noreen
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Lee
H Liu
H Liu
H Poon
J Björne
J Björne
J Björne
J Björne
J Hakenberg
J Stock
J Tsujii
J Wermter
J Wilbur
JD Kim
JD Kim
JD Kim
JD Kim
JD Kim
JD Kim
Jun'ichi Tsujii
K Yoshikawa
L Hirschman
L McGrath
L Tanabe
M Ashburner
M Gerner
M Glickman
M Krallinger
M Miwa
M Miwa
M Narayanaswamy
M Ongenaert
M Porter
M Porter
MC de Marneffe
ME Winston
MS Simpson
N Chinchor
N Chinchor
N Nguyen
O Bodenreider
P Corbett
P Stenetorp
P Stenetorp
P Thomason
P Thompson
P Zweigenbaum
Q Le Minh
R Farkas
R Hoehndorf
R Holliday
R Holliday
R Jaenisch
R Leaman
Rafal Rak
S Ananiadou
S Ananiadou
S Ananiadou
S Pyysalo
S Pyysalo
S Pyysalo
S Pyysalo
S Pyysalo
S Pyysalo
S Pyysalo
S Riedel
S Riedel
S Riedel
S Riedel
S Strassel
S Van Landeghem
S Van Landeghem
S Van Landeghem
S Van Landeghem
Sampo Pyysalo
Sophia Ananiadou
T Krell
T Mascher
T Ohta
T Ohta
T Ohta
T Ohta
T Ohta
T Ohta
T Ohta
Tomoko Ohta
V Vincze
W Hersh
X Yuan
Y Gotoh
Y Sasaki
Y Tateisi
Y Wang
ZZ Hu
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

We present the preparation, resources, results and analysis of three tasks of the BioNLP Shared Task 2011: the main tasks on Infectious Diseases (ID) and Epigenetics and Post-translational Modifications (EPI), and the supporting task on Entity Relations (REL). The two main tasks represent extensions of the event extraction model introduced in the BioNLP Shared Task 2009 (ST'09) to two new areas of biomedical scientific literature, each motivated by the needs of specific biocuration tasks. The ID task concerns the molecular mechanisms of infection, virulence and resistance, focusing in particular on the functions of a class of signaling systems that are ubiquitous in bacteria. The EPI task is dedicated to the extraction of statements regarding chemical modifications of DNA and proteins, with particular emphasis on changes relating to the epigenetic control of gene expression. By contrast to these two application-oriented main tasks, the REL task seeks to support extraction in general by separating challenges relating to part-of relations into a subproblem that can be addressed by independent systems. Seven groups participated in each of the two main tasks and four groups in the supporting task. The participating systems indicated advances in the capability of event extraction methods and demonstrated generalization in many aspects: from abstracts to full texts, from previously considered subdomains to new ones, and from the ST'09 extraction targets to other entities and events. The highest performance achieved in the supporting task REL, 58% F-score, is broadly comparable with levels reported for other relation extraction tasks. For the ID task, the highest-performing system achieved 56% F-score, comparable to the state-of-the-art performance at the established ST'09 task. In the EPI task, the best result was 53% F-score for the full set of extraction targets and 69% F-score for a reduced set of core extraction targets, approaching a level of performance sufficient for user-facing applications. In this study, we extend on previously reported results and perform further analyses of the outputs of the participating systems. We place specific emphasis on aspects of system performance relating to real-world applicability, considering alternate evaluation metrics and performing additional manual analysis of system outputs. We further demonstrate that the strengths of extraction systems can be combined to improve on the performance achieved by any system in isolation. The manually annotated corpora, supporting resources, and evaluation tools for all tasks are available from http://www.bionlp-st.org and the tasks continue as open challenges for all interested parties

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Manchester - Institutional Repository