Search CORE

54 research outputs found

Automatic inference of indexing rules for MEDLINE

Author: A Névéol
A Névéol
A Névéol
A Srinivasan
AR Aronson
Aurélie Névéol
CD Manning
GD Plotkin
J Lin
K Markó
LF Soualmia
ME Funk
R Agrawal
R Rak
S Muggleton
S Ozdowska
S Sohn
Sonya E Shooshan
V Claveau
Vincent Claveau
WL Buntine
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

This paper describes the use and customization of Inductive Logic Programming (ILP) to infer indexing rules from MEDLINE citations. Preliminary results suggest this method may enhance the subheading attachment module of the Medical Text Indexer, a system for assisting MEDLINE indexers.

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A distantly supervised dataset for automated data extraction from diagnostic studies

Author: Kanoulas E.
Leeflang M.
Norman C.
Névéol A.
Spijker R.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

International Migration, Integration and Social Cohesion online publications

A distantly supervised dataset for automated data extraction from diagnostic studies

Author: Kanoulas E.
Leeflang M.
Norman C.
Névéol A.
Spijker R.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

International Migration, Integration and Social Cohesion online publications

A distantly supervised dataset for automated data extraction from diagnostic studies

Author: Kanoulas E.
Leeflang M.
Norman C.
Névéol A.
Spijker R.
Publication venue
Publication date: 01/01/2019
Field of study

International audienceSystematic reviews are important in evidencebased medicine, but are expensive to produce.Automating or semi-automating the data extractionof index test, target condition, and referencestandard from articles has the potentialto decrease the cost of conducting systematicreviews of diagnostic test accuracy, but relevanttraining data is not available. We create adistantly supervised dataset of approximately90,000 sentences, and let two experts manuallyannotate a small subset of around 1,000sentences for evaluation. We evaluate the performanceof BioBERT and logistic regressionfor ranking the sentences, and compare theperformance for distant and direct supervision.Our results suggest that distant supervision canwork as well as, or better than direct supervisionon this problem, and that distantly trainedmodels can perform as well as, or better thanhuman annotators

Crossref

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

International Migration, Integration and Social Cohesion online publications

UvA-DARE

CLEF 2017 eHealth evaluation lab overview

Author: Goeuriot L.
Kanoulas E.
Kelly L.
Névéol A.
Palotti J.
Robert A.
Spijker R.
Suominen H.
Zuccon G.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

International Migration, Integration and Social Cohesion online publications

Recommended from our members

Automatic indexing and retrieval of encounter-specific evidence for point-of-care support

Author: Aronson
Bakkena
Berrios
Berrios
Chambliss
Choi
Ciravegna
CW
Demner-Fushman
Dympna M. O’Sullivan
Elkin
Frants
Friedman
Friedman
Gay
Hauser
Hersh
Hersh
Huang
Ken J. Farion
King
Mao
McDonald
Nadkarni
Névéol
Névéol
Rindflesch
Sackett
Salton
Szymon A. Wilk
Westbrook
Wilk
Wojtek J. Michalowski
Yu
Publication venue: 'Elsevier BV'
Publication date: 01/08/2010
Field of study

Evidence-based medicine relies on repositories of empirical research evidence that can be used to support clinical decision making for improved patient care. However, retrieving evidence from such repositories at local sites presents many challenges. This paper describes a methodological framework for automatically indexing and retrieving empirical research evidence in the form of the systematic reviews and associated studies from The Cochrane Library, where retrieved documents are specific to a patient-physician encounter and thus can be used to support evidence-based decision making at the point of care. Such an encounter is defined by three pertinent groups of concepts - diagnosis, treatment, and patient, and the framework relies on these three groups to steer indexing and retrieval of reviews and associated studies. An evaluation of the indexing and retrieval components of the proposed framework was performed using documents relevant for the pediatric asthma domain. Precision and recall values for automatic indexing of systematic reviews and associated studies were 0.93 and 0.87, and 0.81 and 0.56, respectively. Moreover, precision and recall for the retrieval of relevant systematic reviews and associated studies were 0.89 and 0.81, and 0.92 and 0.89, respectively. With minor modifications, the proposed methodological framework can be customized for other evidence repositories

City Research Online

Elsevier - Publisher Connector

Crossref

TRAP

Aston Publications Explorer

A context-blocks model for identifying clinical relationships in patient records

Author: A Névéol
A Roberts
AK McCallum
AM Cohen
AR Aronson
Aurélie Névéol
C Friedman
ES Chen
F Leitner
H Shatkay
H Xu
J Aberdeen
J Björne
J Lafferty
L Smith
L Tanabe
M Bundschus
M Craven
M Krallinger
N Ponomareva
O Uzuner
O Uzuner
R Harpaz
R Islamaj Doğan
R Islamaj Doğan
Rezarta Islamaj Doğan
SM Meystre
SV Pakhomov
TC Rindflesch
TC Rindflesch
X Wang
X Wang
X Wang
Zhiyong Lu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Constructing a semantic predication gold standard from the biomedical literature

Author: A Jimeno
A Névéol
A Roberts
AR Aronson
AT McCray
B Rosario
C Bizer
C Friedman
C Nédellec
CB Ahlers
D Hristovski
D Maglott
D Rebholz-Schuhmann
G Hripcsak
Graciela Rosemblat
H Kilicoglu
Halil Kilicoglu
J Björne
J Cohen
JD Kim
JD Kim
JD Kim
JD Kim
JP Pestian
L Tanabe
LH Smith
M Bada
M Fiszman
Marcelo Fiszman
O Bodenreider
P Thompson
R Bunescu
S Pyysalo
T Cohen
T Wattarujeekrit
TC Rindflesch
TC Rindflesch
Thomas C Rindflesch
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Semantic relations increasingly underpin biomedical text mining and knowledge discovery applications. The success of such practical applications crucially depends on the quality of extracted relations, which can be assessed against a gold standard reference. Most such references in biomedical text mining focus on narrow subdomains and adopt different semantic representations, rendering them difficult to use for benchmarking independently developed relation extraction systems. In this article, we present a multi-phase gold standard annotation study, in which we annotated 500 sentences randomly selected from MEDLINE abstracts on a wide range of biomedical topics with 1371 semantic predications. The UMLS Metathesaurus served as the main source for conceptual information and the UMLS Semantic Network for relational information. We measured interannotator agreement and analyzed the annotations closely to identify some of the challenges in annotating biomedical text with relations based on an ontology or a terminology. Results We obtain fair to moderate interannotator agreement in the practice phase (0.378-0.475). With improved guidelines and additional semantic equivalence criteria, the agreement increases by 12% (0.415 to 0.536) in the main annotation phase. In addition, we find that agreement increases to 0.688 when the agreement calculation is limited to those predications that are based only on the explicitly provided UMLS concepts and relations. Conclusions While interannotator agreement in the practice phase confirms that conceptual annotation is a challenging task, the increasing agreement in the main annotation phase points out that an acceptable level of agreement can be achieved in multiple iterations, by setting stricter guidelines and establishing semantic equivalence criteria. Mapping text to ontological concepts emerges as the main challenge in conceptual annotation. Annotating predications involving biomolecular entities and processes is particularly challenging. While the resulting gold standard is mainly intended to serve as a test collection for our semantic interpreter, we believe that the lessons learned are applicable generally.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Using Noun Phrases for Navigating Biomedical Literature on Pubmed: How Many Updates Are We Losing Track of?

Author: A Névéol
A Rzhetsky
Andrey Rzhetsky
BM Fonseca
C Jacquemin
C Manning
CD Manning
D Beeferman
D Rebholz-Schuhmann
D Shotton
D Shotton
D Srikrishna
D Trieschnigg
Devabhaktuni Srikrishna
DR Hunter
GF Cooper
J Evans
J Lin
JPA Ionnidis
M Muin
M Weeber
Marc A. Coram
MH MacRoberts
MJ Schuemie
N Tran
O Bodenreider
P Srinivasan
PL Elkin
Q He
Q Li
R Islamaj Dogan
R Schifanella
RA DiGiacomo
S Bird
T Rindflesch
T Wachter
V Sintchenko
W Kim
Y Huang
Z Lu
Z Sun
Publication venue: Public Library of Science
Publication date: 14/09/2011
Field of study

Author-supplied citations are a fraction of the related literature for a paper. The “related citations” on PubMed is typically dozens or hundreds of results long, and does not offer hints why these results are related. Using noun phrases derived from the sentences of the paper, we show it is possible to more transparently navigate to PubMed updates through search terms that can associate a paper with its citations. The algorithm to generate these search terms involved automatically extracting noun phrases from the paper using natural language processing tools, and ranking them by the number of occurrences in the paper compared to the number of occurrences on the web. We define search queries having at least one instance of overlap between the author-supplied citations of the paper and the top 20 search results as citation validated (CV). When the overlapping citations were written by same authors as the paper itself, we define it as CV-S and different authors is defined as CV-D. For a systematic sample of 883 papers on PubMed Central, at least one of the search terms for 86% of the papers is CV-D versus 65% for the top 20 PubMed “related citations.” We hypothesize these quantities computed for the 20 million papers on PubMed to differ within 5% of these percentages. Averaged across all 883 papers, 5 search terms are CV-D, and 10 search terms are CV-S, and 6 unique citations validate these searches. Potentially related literature uncovered by citation-validated searches (either CV-S or CV-D) are on the order of ten per paper – many more if the remaining searches that are not citation-validated are taken into account. The significance and relationship of each search result to the paper can only be vetted and explained by a researcher with knowledge of or interest in that paper

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Un service Web pour l’annotation sémantique de données biomédicales avec des ontologies

Author: A. Névéol
A.B. Can
C. Jonquet
D.L. Rubin
J. Euzenat
J.E. Caviedesa
K. Khelif
M.N. Kamel-Boulos
N. Bhatia
N.C. Ide
N.H. Shah
O. Bodenreider
O. Corby
R. Moskovitch
T. Pedersen
W.R. Hersh
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref