Search CORE

53 research outputs found

Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

Author: Björne J
Fares M
Johansson R
Oepen S
Øvrelid L
Publication venue
Publication date: 27/10/2022
Field of study

UTUPub

The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011

Author: A Casillas
A Vlachos
A Vlachos
Akinori Yonezawa
C Quirk
D McClosky
D Tuggener
E Emadzadeh
H Kilicoglu
H Kilicoglu
H Liu
H Poon
J Björne
J Björne
J Björne
JD Kim
JD Kim
JD Kim
JD Kim
Jin-Dong Kim
Jun'ichi Tsujii
KB Cohen
L Hirschman
M Miwa
M Miwa
N Chinchor
N Nguyen
Ngan Nguyen
NL Nguyen
Q Le Minh
QC Bui
S Riedel
S Riedel
S Riedel
Toshihisa Takagi
Y Kim
Yue Wang
Publication venue: BioMed Central
Publication date
Field of study

Crossref

PubMed Central

All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning

Author: A Airola
A Yakushiji
AB Clegg
Antti Airola
AP Bradley
C Giuliano
C Nédellec
CD Meyer
D Zelenko
Filip Ginter
J Björne
J Ding
J Heimonen
JA Hanley
JAK Suykens
Jari Björne
JD Kim
JG Caporaso
K Fundel
KB Cohen
L Hirschman
L Hunter
M Lease
M Miwa
MC de Marneffe
P Zweigenbaum
R Bunescu
R Bunescu
R Bunescu
R Rifkin
R Sætre
S Pyysalo
S Pyysalo
S Pyysalo
S Van Landeghem
Sampo Pyysalo
T Gärtner
T Mitsumori
T Pahikkala
T Pahikkala
Tapio Pahikkala
Tapio Salakoski
Y Miyao
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

U-Compare bio-event meta-service: compatible BioNLP event extraction services

AbstractBackgroundBio-molecular event extraction from literature is recognized as an important task of bio text mining and, as such, many relevant systems have been developed and made available during the last decade. While such systems provide useful services individually, there is a need for a meta-service to enable comparison and ensemble of such services, offering optimal solutions for various purposes.ResultsWe have integrated nine event extraction systems in the U-Compare framework, making them inter-compatible and interoperable with other U-Compare components. The U-Compare event meta-service provides various meta-level features for comparison and ensemble of multiple event extraction systems. Experimental results show that the performance improvements achieved by the ensemble are significant. ConclusionsWhile individual event extraction systems themselves provide useful features for bio text mining, the U-Compare meta-service is expected to improve the accessibility to the individual systems, and to enable meta-level uses over multiple event extraction systems such as comparison and ensemble.This research was partially supported by KAKENHI 18002007 [YK, MM, JDK, SP, TO, JT]; JST PRESTO and KAKENHI 21500130 [YK]; the Academy of Finland and computational resources were provided by CSC -- IT Center for Science Ltd [JB, FG]; the Research Foundation Flanders (FWO) [SVL]; UK Biotechnology and Biological Sciences, Research Council (BBSRC project BB/G013160/1 Automated Biological Event Extraction from the Literature for Drug Discovery) and JISC, National Centre for Text Mining [SA]; the Spanish grant BIO2010-17527 [MN, APM]; NIH Grant U54 DA021519 [AO, DRR]Peer Reviewe

Crossref

Springer - Publisher Connector

Ghent University Academic Bibliography

PubMed Central

UCL Discovery

Digital.CSIC

The University of Manchester - Institutional Repository

NORA - Norwegian Open Research Archives

Secretaría de Estado de Cultura

Deep Blue Documents

Comparative analysis of five protein-protein interaction corpora

Author: A Rzhetsky
Antti Airola
C Blaschke
C Nédellec
D Klein
DJ Best
Filip Ginter
HL Johnson
J Ding
J Kim
Jari Björne
JD Wren
Juho Heimonen
K Fundel
KB Cohen
L Smith
M Light
N Daraselia
R Bunescu
R Ihaka
S Pyysalo
Sampo Pyysalo
Tapio Salakoski
WJ Wilbur
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Growing interest in the application of natural language processing methods to biomedical text has led to an increasing number of corpora and methods targeting protein-protein interaction (PPI) extraction. However, there is no general consensus regarding PPI annotation and consequently resources are largely incompatible and methods are difficult to evaluate. Results We present the first comparative evaluation of the diverse PPI corpora, performing quantitative evaluation using two separate information extraction methods as well as detailed statistical and qualitative analyses of their properties. For the evaluation, we unify the corpus PPI annotations to a shared level of information, consisting of undirected, untyped binary interactions of non-static types with no identification of the words specifying the interaction, no negations, and no interaction certainty. We find that the F-score performance of a state-of-the-art PPI extraction method varies on average 19 percentage units and in some cases over 30 percentage units between the different evaluated corpora. The differences stemming from the choice of corpus can thus be substantially larger than differences between the performance of PPI extraction methods, which suggests definite limits on the ability to compare methods evaluated on different resources. We analyse a number of potential sources for these differences and identify factors explaining approximately half of the variance. We further suggest ways in which the difficulty of the PPI extraction tasks codified by different corpora can be determined to advance comparability. Our analysis also identifies points of agreement and disagreement in PPI corpus annotation that are rarely explicitly stated by the authors of the corpora. Conclusions Our comparative analysis uncovers key similarities and differences between the diverse PPI corpora, thus taking an important step towards standardization. In the course of this study we have created a major practical contribution in converting the corpora into a shared format. The conversion software is freely available at <url>http://mars.cs.utu.fi/PPICorpora</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Biological event composition

Author: A Andreevskaia
A Haghighi
A Kennedy
A Vlachos
AR Aronson
AT McCray
B Settles
C Fellbaum
D McClosky
E Charniak
E Miltsakaki
FR Palmer
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Poon
Halil Kilicoglu
IA Mel'čuk
J Björne
J Björne
J Lyons
J Sowa
JD Kim
JD Kim
JD Kim
K Baker
K Moilanen
K Yoshikawa
KB Cohen
L Danlos
L Polanyi
M Miwa
MC de Marneffe
N Asher
P Thompson
R Bossy
R Nairn
R Power
R Saurí
S Nirenburg
S Pyysalo
Sabine Bergler
T Cohen
T Wilson
WC Mann
Y Kano
Y Kim
Publication venue: BioMed Central
Publication date
Field of study

Crossref

PubMed Central

Classifying protein-protein interaction articles using word and syntactic features

Author: A Ceol
B Aranda
B Settles
C Blaschke
D Rebholz-Schuhmann
E Buyko
GD Bader
GEAPA Batista
H Jang
HJ Lowe
J Björne
JR Curran
K Sugiyama
L Salwinski
L Tanabe
LH Smith
M Huang
M Krallinger
M Krallinger
M Kubat
MF Porter
P Baldi
RK Ando
S Kim
S Kim
S Nash
Sun Kim
T Mitsumort
T Zhang
VN Vapnik
W John Wilbur
Y Miyao
Y Niu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

A context-blocks model for identifying clinical relationships in patient records

Author: A Névéol
A Roberts
AK McCallum
AM Cohen
AR Aronson
Aurélie Névéol
C Friedman
ES Chen
F Leitner
H Shatkay
H Xu
J Aberdeen
J Björne
J Lafferty
L Smith
L Tanabe
M Bundschus
M Craven
M Krallinger
N Ponomareva
O Uzuner
O Uzuner
R Harpaz
R Islamaj Doğan
R Islamaj Doğan
Rezarta Islamaj Doğan
SM Meystre
SV Pakhomov
TC Rindflesch
TC Rindflesch
X Wang
X Wang
X Wang
Zhiyong Lu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Overview of the ID, EPI and REL tasks of BioNLP Shared Task 2011

Author: A Morgan
A Riggs
A Vlachos
A Yeh
Bruno Sobral
C Arighi
C Nédellec
C Quirk
C Wang
CH Wei
CH Wu
Chunhong Mao
Chunxia Wang
D Barford
D McClosky
D McClosky
D McClosky
D McClosky
D Rebholz-Schuhmann
D Tikk
Dan Sullivan
DD Sleator
E Buyko
E Charniak
ES Witze
EW Noreen
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Lee
H Liu
H Liu
H Poon
J Björne
J Björne
J Björne
J Björne
J Hakenberg
J Stock
J Tsujii
J Wermter
J Wilbur
JD Kim
JD Kim
JD Kim
JD Kim
JD Kim
JD Kim
Jun'ichi Tsujii
K Yoshikawa
L Hirschman
L McGrath
L Tanabe
M Ashburner
M Gerner
M Glickman
M Krallinger
M Miwa
M Miwa
M Narayanaswamy
M Ongenaert
M Porter
M Porter
MC de Marneffe
ME Winston
MS Simpson
N Chinchor
N Chinchor
N Nguyen
O Bodenreider
P Corbett
P Stenetorp
P Stenetorp
P Thomason
P Thompson
P Zweigenbaum
Q Le Minh
R Farkas
R Hoehndorf
R Holliday
R Holliday
R Jaenisch
R Leaman
Rafal Rak
S Ananiadou
S Ananiadou
S Ananiadou
S Pyysalo
S Pyysalo
S Pyysalo
S Pyysalo
S Pyysalo
S Pyysalo
S Pyysalo
S Riedel
S Riedel
S Riedel
S Riedel
S Strassel
S Van Landeghem
S Van Landeghem
S Van Landeghem
S Van Landeghem
Sampo Pyysalo
Sophia Ananiadou
T Krell
T Mascher
T Ohta
T Ohta
T Ohta
T Ohta
T Ohta
T Ohta
T Ohta
Tomoko Ohta
V Vincze
W Hersh
X Yuan
Y Gotoh
Y Sasaki
Y Tateisi
Y Wang
ZZ Hu
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

We present the preparation, resources, results and analysis of three tasks of the BioNLP Shared Task 2011: the main tasks on Infectious Diseases (ID) and Epigenetics and Post-translational Modifications (EPI), and the supporting task on Entity Relations (REL). The two main tasks represent extensions of the event extraction model introduced in the BioNLP Shared Task 2009 (ST'09) to two new areas of biomedical scientific literature, each motivated by the needs of specific biocuration tasks. The ID task concerns the molecular mechanisms of infection, virulence and resistance, focusing in particular on the functions of a class of signaling systems that are ubiquitous in bacteria. The EPI task is dedicated to the extraction of statements regarding chemical modifications of DNA and proteins, with particular emphasis on changes relating to the epigenetic control of gene expression. By contrast to these two application-oriented main tasks, the REL task seeks to support extraction in general by separating challenges relating to part-of relations into a subproblem that can be addressed by independent systems. Seven groups participated in each of the two main tasks and four groups in the supporting task. The participating systems indicated advances in the capability of event extraction methods and demonstrated generalization in many aspects: from abstracts to full texts, from previously considered subdomains to new ones, and from the ST'09 extraction targets to other entities and events. The highest performance achieved in the supporting task REL, 58% F-score, is broadly comparable with levels reported for other relation extraction tasks. For the ID task, the highest-performing system achieved 56% F-score, comparable to the state-of-the-art performance at the established ST'09 task. In the EPI task, the best result was 53% F-score for the full set of extraction targets and 69% F-score for a reduced set of core extraction targets, approaching a level of performance sufficient for user-facing applications. In this study, we extend on previously reported results and perform further analyses of the outputs of the participating systems. We place specific emphasis on aspects of system performance relating to real-world applicability, considering alternate evaluation metrics and performing additional manual analysis of system outputs. We further demonstrate that the strengths of extraction systems can be combined to improve on the performance achieved by any system in isolation. The manually annotated corpora, supporting resources, and evaluation tools for all tasks are available from http://www.bionlp-st.org and the tasks continue as open challenges for all interested parties

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Manchester - Institutional Repository