Search CORE

66 research outputs found

Sortal anaphora resolution to enhance relation extraction from biomedical literature

Author: A Haghighi
A Rahman
AR Aronson
AR Aronson
AT McCray
BJ Grosz
C Gasperin
CD Manning
CM Miller
D Hristovski
D Weissenbacher
E Hovy
G Hripscak
G Rosemblat
Graciela Rosemblat
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Kilicoglu
H Lee
Halil Kilicoglu
I Segura-Bedmar
J Castaño
J Cohen
J D’Souza
J Zheng
JD Kim
JJ Kim
K Yoshikawa
KB Cohen
LH Smith
M Choi
M Miwa
M Torii
Marcelo Fiszman
NLT Nguyen
O Bodenreider
P Stenetorp
P Thompson
S Bergsma
S Lappin
S Pradhan
T Lavergne
TC Rindflesch
Thomas C. Rindflesch
V Ng
V Ng
WM Soon
X Yang
Y Kim
Y Xu
Ö Uzuner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Machine learning applied to the h index of colombian authors with publications in scopus

Author: A Abrishami
A Ibáñez
G Csomós
H Kilicoglu
J Demšar
MA Quinapanta
Maritza Torres-Samuel
Maritza Torres-Samuel
Mignon L. Wuestman
Nathanael J Fast
R Rosales
Publication venue: 'Corporation Universidad de la Costa, CUC'
Publication date: 01/01/2019
Field of study

Our research aims to establish how to predict the H index of Colombian authors with publications in Scopus until 2016. The selection of the date was because, as mentioned earlier, the number of documents indexed per year exceeded 10,000 and they obtained the highest number of documents cited. To accomplish this purpose, a quantitative, nonexperimental, cross-sectional, descriptive, explanatory, and predictive research was designed using supervised learning algorithms. These were applied to information from 8,840 Colombian authors. Among the findings we can highlight that: (i) Colombia is in the fifth position in the scope of countries of South America and the Caribbean, in terms of the number of products and citations; (ii) the largest number of Colombian authors with products in Scopus until 2016, belonged mainly to the area of natural sciences, followed by medical sciences and health; (iii) most of the Colombian authors were men (64.2%, or 5,442) and they have higher H index rates than women; (iv) using random cross validation for 10 iterations, the methods with the best predictive value using R2 and the minimization of mean absolute error (MAE) correspond to: AdaBoost (96.6% and 0.397, respectively); Random Forest (96.8% and 0.431, respectively); KNN (94.4% and 0.525, respectively); Tree (94.9% and 0.53, respectively); and Neural Network (93.3% and 0.7, respectively); and (v) the variables that help predict the H index in the case of the Colombian authors, in addition to the citations, correspond to: the quantity of products, number of products in Q1, and international collaboratio

Crossref

Repositorio Digital CUC

Recognizing speculative language in biomedical research articles: a linguistically motivated perspective

Author: B Medlock
C DiMarco
C Fellbaum
C Friedman
D Klein
FR Palmer
G Lakoff
G Szarvas
G Szarvas
Halil Kilicoglu
J Pustejovsky
K Hyland
K Kipper-Schuler
M Light
MC deMarneffe
P Thompson
R Saurí
Sabine Bergler
TM Mitchell
W Chafe
WJ Wilbur
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

We explore a linguistically motivated approach to the problem of recognizing speculative language (“hedging”) in biomedical research articles. We describe a method, which draws on prior linguistic work as well as existing lexical resources and extends them by introducing syntactic patterns and a simple weighting scheme to estimate the speculation level of the sentences. We show that speculative language can be recognized successfully with such an approach, discuss some shortcomings of the method and point out future research possibilities.

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Constructing a semantic predication gold standard from the biomedical literature

Author: A Jimeno
A Névéol
A Roberts
AR Aronson
AT McCray
B Rosario
C Bizer
C Friedman
C Nédellec
CB Ahlers
D Hristovski
D Maglott
D Rebholz-Schuhmann
G Hripcsak
Graciela Rosemblat
H Kilicoglu
Halil Kilicoglu
J Björne
J Cohen
JD Kim
JD Kim
JD Kim
JD Kim
JP Pestian
L Tanabe
LH Smith
M Bada
M Fiszman
Marcelo Fiszman
O Bodenreider
P Thompson
R Bunescu
S Pyysalo
T Cohen
T Wattarujeekrit
TC Rindflesch
TC Rindflesch
Thomas C Rindflesch
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Semantic relations increasingly underpin biomedical text mining and knowledge discovery applications. The success of such practical applications crucially depends on the quality of extracted relations, which can be assessed against a gold standard reference. Most such references in biomedical text mining focus on narrow subdomains and adopt different semantic representations, rendering them difficult to use for benchmarking independently developed relation extraction systems. In this article, we present a multi-phase gold standard annotation study, in which we annotated 500 sentences randomly selected from MEDLINE abstracts on a wide range of biomedical topics with 1371 semantic predications. The UMLS Metathesaurus served as the main source for conceptual information and the UMLS Semantic Network for relational information. We measured interannotator agreement and analyzed the annotations closely to identify some of the challenges in annotating biomedical text with relations based on an ontology or a terminology. Results We obtain fair to moderate interannotator agreement in the practice phase (0.378-0.475). With improved guidelines and additional semantic equivalence criteria, the agreement increases by 12% (0.415 to 0.536) in the main annotation phase. In addition, we find that agreement increases to 0.688 when the agreement calculation is limited to those predications that are based only on the explicitly provided UMLS concepts and relations. Conclusions While interannotator agreement in the practice phase confirms that conceptual annotation is a challenging task, the increasing agreement in the main annotation phase points out that an acceptable level of agreement can be achieved in multiple iterations, by setting stricter guidelines and establishing semantic equivalence criteria. Mapping text to ontological concepts emerges as the main challenge in conceptual annotation. Annotating predications involving biomolecular entities and processes is particularly challenging. While the resulting gold standard is mainly intended to serve as a test collection for our semantic interpreter, we believe that the lessons learned are applicable generally.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Large-scale adverse effects related to treatment evidence standardization (LAERTES): an open scalable system for linking pharmacovigilance evidence sources with clinical data

Author: A Cami
AB Hill
BT McInnes
CC Freifeld
D Cameron
EP van Puijenbroek
F Cheng
G Jiang
G Jiang
H Kilicoglu
JM Banda
K O’Connor
M Dumontier
M Liu
M Yang
MJ Schuemie
N Shang
P Avillach
R Harpaz
R Harpaz
RD Boyce
S Ayvaz
SV Iyer
VG Koutkias
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Detecting modification of biomedical events using a deep parsing approach

Author: A Copestake
A Copestake
A Copestake
A Frank
A MacKinlay
Andrew MacKinlay
B Medlock
C Pollard
D Flickinger
David Martinez
E Briscoe
E Buyko
E Velldal
G Móra
H Kilicoglu
H Uszkoreit
I Solt
J Björne
J Hakenberg
JD Kim
KB Cohen
P Adolphs
R Farkas
S Van Landeghem
Timothy Baldwin
U Callmeier
V Vincze
WW Chapman
Y Tsuruoka
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background This work describes a system for identifying event mentions in bio-molecular research abstracts that are either speculative (e.g. <it>analysis of IkappaBalpha phosphorylation</it>, where it is not specified whether phosphorylation did or did not occur) or negated (e.g. <it>inhibition of IkappaBalpha phosphorylation</it>, where phosphorylation did <it>not </it>occur). The data comes from a standard dataset created for the BioNLP 2009 Shared Task. The system uses a machine-learning approach, where the features used for classification are a combination of shallow features derived from the words of the sentences and more complex features based on the semantic outputs produced by a deep parser. Method To detect event modification, we use a Maximum Entropy learner with features extracted from the data relative to the trigger words of the events. The shallow features are bag-of-words features based on a small sliding context window of 3-4 tokens on either side of the trigger word. The deep parser features are derived from parses produced by the English Resource Grammar and the <it>RASP </it>parser. The outputs of these parsers are converted into the Minimal Recursion Semantics formalism, and from this, we extract features motivated by linguistics and the data itself. All of these features are combined to create training or test data for the machine learning algorithm. Results Over the test data, our methods produce approximately a 4% absolute increase in F-score for detection of event modification compared to a baseline based only on the shallow bag-of-words features. Conclusions Our results indicate that grammar-based techniques can enhance the accuracy of methods for detecting event modification.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

Caipirini: using gene sets to rank literature

Author: A Barbosa-Silva
Adriano Barbosa-Silva
AM Cohen
Ana Carolina Wanderley-Nogueira
C Nobata
GB Martin
Georgios A Pavlopoulos
GL Poulter
H Kessman
H Kilicoglu
J Lewis
J-B Morel
JF Fontaine
KA Pattin
LJ Jensen
N Polavarapu
Nina Mota Soares-Cavalcanti
PK Shah
R Altman
R Rodriguez-Esteban
Reinhard Schneider
S Yu
Seán I O'Donoghue
T Etzold
T Goetz
T Soldatos
T Tuchler
Theodoros G Soldatos
Venkata P Satagopam
W Yu
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Keeping up-to-date with bioscience literature is becoming increasingly challenging. Several recent methods help meet this challenge by allowing literature search to be launched based on lists of abstracts that the user judges to be 'interesting'. Some methods go further by allowing the user to provide a second input set of 'uninteresting' abstracts; these two input sets are then used to search and rank literature by relevance. In this work we present the service 'Caipirini' (<url>http://caipirini.org</url>) that also allows two input sets, but takes the novel approach of allowing ranking of literature based on one or more sets of genes. Results To evaluate the usefulness of Caipirini, we used two test cases, one related to the human cell cycle, and a second related to disease defense mechanisms in <it>Arabidopsis thaliana</it>. In both cases, the new method achieved high precision in finding literature related to the biological mechanisms underlying the input data sets. Conclusions To our knowledge Caipirini is the first service enabling literature search directly based on biological relevance to gene sets; thus, Caipirini gives the research community a new way to unlock hidden knowledge from gene sets derived via high-throughput experiments.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UNSWorks

MDC Repository

Open Repository and Bibliography - Luxembourg

Semi-automated screening of biomedical citations for systematic reviews

Author: A Aronson
A Blum
A Cohen
A Wilcox
B Settles
B Wallace
Byron C Wallace
C Blake
C Cole
C Counsell
Carla Brodley
Chih-Chung
Christopher H Schmid
CJL Chih-Wei Hsu
D Chen
DD Lewis
E Perrin
F Camous
G Druck
G Schohn
H Kilicoglu
Joseph Lau
K Brinker
KS Goh
KS Jones
L Breiman
L Hunter
M Barza
M Chung
M Yetisgen-Yildiz
N Japkowicz
P Wheeler
P Zweigenbaum
S Dasgupta
S Ertekin
S Kotsiantis
S Tong
T Joachims
T Terasawa
Thomas A Trikalinos
VN Vapnik
W Yu
Y Aphinyanaphongs
YAC Aphinyanaphongs
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Leukotriene biosynthesis inhibition ameliorates acute lung injury following hemorrhagic shock in rats

Abstract Background Hemorrhagic shock followed by resuscitation is conceived as an insult frequently induces a systemic inflammatory response syndrome and oxidative stress that results in multiple-organ dysfunction syndrome including acute lung injury. MK-886 is a leukotriene biosynthesis inhibitor exerts an anti inflammatory and antioxidant activity. Objectives The objective of present study was to assess the possible protective effect of MK-886 against hemorrhagic shock-induced acute lung injury via interfering with inflammatory and oxidative pathways. Materials and methods Eighteen adult Albino rats were assigned to three groups each containing six rats: group I, sham group, rats underwent all surgical instrumentation but neither hemorrhagic shock nor resuscitation was done; group II, Rats underwent hemorrhagic shock (HS) for 1 hr then resuscitated with Ringer's lactate (1 hr) (induced untreated group, HS); group III, HS + MK-886 (0.6 mg/kg i.p. injection 30 min before the induction of HS, and the same dose was repeated just before reperfusion period). At the end of experiment (2 hr after completion of resuscitation), blood samples were collected for measurement of serum tumor necrosis factor-α (TNF-α) and interleukin-6 (IL-6). The trachea was then isolated and bronchoalveolar lavage fluid (BALF) was carried out for measurement of leukotriene B4 (LTB4), leukotriene C4 (LTC4) and total protein. The lungs were harvested, excised and the left lung was homogenized for measurement of malondialdehyde (MDA) and reduced glutathione (GSH) and the right lung was fixed in 10% formalin for histological examination. Results MK-886 treatment significantly reduced the total lung injury score compared with the HS group (<it>P </it>< 0.05). MK-886 also significantly decreased serum TNF-α & IL-6; lung MDA; BALF LTB4, LTC4 & total protein compared with the HS group (<it>P </it>< 0.05). MK-886 treatment significantly prevented the decrease in the lung GSH levels compared with the HS group (<it>P </it>< 0.05). Conclusions The results of the present study reveal that MK-886 may ameliorate lung injury in shocked rats via interfering with inflammatory and oxidative pathways implicating the role of leukotrienes in the pathogenesis of hemorrhagic shock-induced lung inflammation.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central