Search CORE

99 research outputs found

The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011

Author: A Casillas
A Vlachos
A Vlachos
Akinori Yonezawa
C Quirk
D McClosky
D Tuggener
E Emadzadeh
H Kilicoglu
H Kilicoglu
H Liu
H Poon
J Björne
J Björne
J Björne
JD Kim
JD Kim
JD Kim
JD Kim
Jin-Dong Kim
Jun'ichi Tsujii
KB Cohen
L Hirschman
M Miwa
M Miwa
N Chinchor
N Nguyen
Ngan Nguyen
NL Nguyen
Q Le Minh
QC Bui
S Riedel
S Riedel
S Riedel
Toshihisa Takagi
Y Kim
Yue Wang
Publication venue: BioMed Central
Publication date
Field of study

Crossref

PubMed Central

Comparable Study of Event Extraction in Newswire and Biomedical Domains

Author: Ananiadou Sophia
Korkontzelos Yannis
Miwa Makoto
Thompson Paul
Publication venue
Publication date: 01/08/2014
Field of study

Edge Hill University Research Information Repository

Biological event composition

Author: A Andreevskaia
A Haghighi
A Kennedy
A Vlachos
AR Aronson
AT McCray
B Settles
C Fellbaum
D McClosky
E Charniak
E Miltsakaki
FR Palmer
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Poon
Halil Kilicoglu
IA Mel'čuk
J Björne
J Björne
J Lyons
J Sowa
JD Kim
JD Kim
JD Kim
K Baker
K Moilanen
K Yoshikawa
KB Cohen
L Danlos
L Polanyi
M Miwa
MC de Marneffe
N Asher
P Thompson
R Bossy
R Nairn
R Power
R Saurí
S Nirenburg
S Pyysalo
Sabine Bergler
T Cohen
T Wilson
WC Mann
Y Kano
Y Kim
Publication venue: BioMed Central
Publication date
Field of study

Crossref

PubMed Central

New Resources and Perspectives for Biomedical Event Extraction

Author: Ananiadou S
Kim J-D
Ohta T
Pyysalo S
Stenetorp P
Publication venue
Publication date: 01/01/2012
Field of study

Event extraction is a major focus of recent work in biomedical information extraction. Despite substantial advances, many challenges still remain for reliable automatic extraction of events from text. We introduce a new biomedical event extraction resource consisting of analyses automatically created by systems participating in the recent BioNLP Shared Task (ST) 2011. In providing for the first time the outputs of a broad set of state-ofthe-art event extraction systems, this resource opens many new opportunities for studying aspects of event extraction, from the identification of common errors to the study of effective approaches to combining the strengths of systems. We demonstrate these opportunities through a multi-system analysis on three BioNLP ST 2011 main tasks, focusing on events that none of the systems can successfully extract. We further argue for new perspectives to the performance evaluation of domain event extraction systems, considering a document-level, “off-the-page ” representation and evaluation to complement the mentionlevel evaluations pursued in most recent work.

CiteSeerX

The University of Manchester - Institutional Repository

Boosting automatic event extraction from the literature using domain adaptation and coreference resolution

Author: Ananiadou
Bodenreider
Graf
Kim
M. Miwa
Miwa
P. Thompson
Pyysalo
S. Ananiadou
Thompson
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Motivation: In recent years, several biomedical event extraction (EE) systems have been developed. However, the nature of the annotated training corpora, as well as the training process itself, can limit the performance levels of the trained EE systems. In particular, most event-annotated corpora do not deal adequately with coreference. This impacts on the trained systems' ability to recognize biomedical entities, thus affecting their performance in extracting events accurately. Additionally, the fact that most EE systems are trained on a single annotated corpus further restricts their coverage

CiteSeerX

Crossref

PubMed Central

The University of Manchester - Institutional Repository

University of Turku in the BioNLP'11 Shared Task

Author: A Jimeno Yepes
D McClosky
D McClosky
de Marneffe
E Buyko
E Charniak
Filip Ginter
H Kilicoglu
H Kilicoglu
I Tsochantaridis
J Björne
J Björne
J Björne
J Heimonen
J Jourde
Jari Björne
JD Kim
JD Kim
JD Kim
JP Euzéby
M Miwa
M Miwa
MC de Marneffe
MF Porter
N Nguyen
P Stenetorp
R Bossy
S Pyysalo
S Pyysalo
S Riedel
S Riedel
S Riedel
S Van Landeghem
S Van Landeghem
T Ohta
Tapio Salakoski
Y Kim
Z Ratkovic
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Coreference based event-argument relation extraction on biomedical text

Author: Asahara Masayuki
Hirao Tsutomu
Matsumoto Yuji
Riedel Sebastian
Yoshikawa Katsumasa
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

This paper presents a new approach to exploit coreference information for extracting event-argument (E-A) relations from biomedical documents. This approach has two advantages: (1) it can extract a large number of valuable E-A relations based on the concept of salience in discourse; (2) it enables us to identify E-A relations over sentence boundaries (cross-links) using transitivity of coreference relations. We propose two coreference-based models: a pipeline based on Support Vector Machine (SVM) classifiers, and a joint Markov Logic Network (MLN). We show the effectiveness of these models on a biomedical event corpus. Both models outperform the systems that do not use coreference information. When the two proposed models are compared to each other, joint MLN outperforms pipeline SVM with gold coreference information

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UCL Discovery

Biomedical Event Extraction with Machine Learning

Author: Björne Jari
Publication venue: TUCS Dissertations
Publication date: 28/10/2022
Field of study

Biomedical natural language processing (BioNLP) is a subfield of natural language processing, an area of computational linguistics concerned with developing programs that work with natural language: written texts and speech. Biomedical relation extraction concerns the detection of semantic relations such as protein--protein interactions (PPI) from scientific texts. The aim is to enhance information retrieval by detecting relations between concepts, not just individual concepts as with a keyword search. In recent years, events have been proposed as a more detailed alternative for simple pairwise PPI relations. Events provide a systematic, structural representation for annotating the content of natural language texts. Events are characterized by annotated trigger words, directed and typed arguments and the ability to nest other events. For example, the sentence ``Protein A causes protein B to bind protein C'' can be annotated with the nested event structure CAUSE(A, BIND(B, C)). Converted to such formal representations, the information of natural language texts can be used by computational applications. Biomedical event annotations were introduced by the BioInfer and GENIA corpora, and event extraction was popularized by the BioNLP'09 Shared Task on Event Extraction. In this thesis we present a method for automated event extraction, implemented as the Turku Event Extraction System (TEES). A unified graph format is defined for representing event annotations and the problem of extracting complex event structures is decomposed into a number of independent classification tasks. These classification tasks are solved using SVM and RLS classifiers, utilizing rich feature representations built from full dependency parsing.  Building on earlier work on pairwise relation extraction and using a generalized graph representation, the resulting TEES system is capable of detecting binary relations as well as complex event structures. We show that this event extraction system has good performance, reaching the first place in the BioNLP'09 Shared Task on Event Extraction. Subsequently, TEES has achieved several first ranks in the BioNLP'11 and BioNLP'13 Shared Tasks, as well as shown competitive performance in the binary relation Drug-Drug Interaction Extraction 2011 and 2013 shared tasks. The Turku Event Extraction System is published as a freely available open-source project, documenting the research in detail as well as making the method available for practical applications. In particular, in this thesis we describe the application of the event extraction method to PubMed-scale text mining, showing how the developed approach not only shows good performance, but is generalizable and applicable to large-scale real-world text mining projects. Finally, we discuss related literature, summarize the contributions of the work and present some thoughts on future directions for biomedical event extraction. This thesis includes and builds on six original research publications. The first of these introduces the analysis of dependency parses that leads to development of TEES. The entries in the three BioNLP Shared Tasks, as well as in the DDIExtraction 2011 task are covered in four publications, and the sixth one demonstrates the application of the system to PubMed-scale text mining.</p

UTUPub

Biomedical Event Extraction with Machine Learning

Author: Björne Jari
Publication venue: Turku Centre for Computer Science
Publication date: 07/08/2014
Field of study

Biomedical natural language processing (BioNLP) is a subfield of natural language processing, an area of computational linguistics concerned with developing programs that work with natural language: written texts and speech. Biomedical relation extraction concerns the detection of semantic relations such as protein-protein interactions (PPI) from scientific texts. The aim is to enhance information retrieval by detecting relations between concepts, not just individual concepts as with a keyword search. In recent years, events have been proposed as a more detailed alternative for simple pairwise PPI relations. Events provide a systematic, structural representation for annotating the content of natural language texts. Events are characterized by annotated trigger words, directed and typed arguments and the ability to nest other events. For example, the sentence “Protein A causes protein B to bind protein C” can be annotated with the nested event structure CAUSE(A, BIND(B, C)). Converted to such formal representations, the information of natural language texts can be used by computational applications. Biomedical event annotations were introduced by the BioInfer and GENIA corpora, and event extraction was popularized by the BioNLP'09 Shared Task on Event Extraction. In this thesis we present a method for automated event extraction, implemented as the Turku Event Extraction System (TEES). A unified graph format is defined for representing event annotations and the problem of extracting complex event structures is decomposed into a number of independent classification tasks. These classification tasks are solved using SVM and RLS classifiers, utilizing rich feature representations built from full dependency parsing. Building on earlier work on pairwise relation extraction and using a generalized graph representation, the resulting TEES system is capable of detecting binary relations as well as complex event structures. We show that this event extraction system has good performance, reaching the first place in the BioNLP'09 Shared Task on Event Extraction. Subsequently, TEES has achieved several first ranks in the BioNLP'11 and BioNLP'13 Shared Tasks, as well as shown competitive performance in the binary relation Drug-Drug Interaction Extraction 2011 and 2013 shared tasks. The Turku Event Extraction System is published as a freely available open-source project, documenting the research in detail as well as making the method available for practical applications. In particular, in this thesis we describe the application of the event extraction method to PubMed-scale text mining, showing how the developed approach not only shows good performance, but is generalizable and applicable to large-scale real-world text mining projects. Finally, we discuss related literature, summarize the contributions of the work and present some thoughts on future directions for biomedical event extraction. This thesis includes and builds on six original research publications. The first of these introduces the analysis of dependency parses that leads to development of TEES. The entries in the three BioNLP Shared Tasks, as well as in the DDIExtraction 2011 task are covered in four publications, and the sixth one demonstrates the application of the system to PubMed-scale text mining.Siirretty Doriast

UTUPub