Search CORE

1,539 research outputs found

Automated Detection of Systematic Off-label Drug Use in Free Text of Electronic Medical Records.

Author: Jung Kenneth
Lependu Paea
Shah Nigam
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

Off-label use of a drug occurs when it is used in a manner that deviates from its FDA label. Studies estimate that 21% of prescriptions are off-label, with only 27% of those uses supported by evidence of safety and efficacy. We have developed methods to detect population level off-label usage using computationally efficient annotation of free text from clinical notes to generate features encoding empirical information about drug-disease mentions. By including additional features encoding prior knowledge about drugs, diseases, and known usage, we trained a highly accurate predictive model that was used to detect novel candidate off-label usages in a very large clinical corpus. We show that the candidate uses are plausible and can be prioritized for further analysis in terms of safety and efficacy

PubMed Central

eScholarship - University of California

Automatic construction of rule-based ICD-9-CM coding systems

Author: AL Berger
AR Aronson
D Lang
György Szarvas
I Goldstein
IH Witten
J Patrick
JP Pestian
K Crammer
LRS de Lima
LS Larkey
MA Moisio
Richárd Farkas
WW Chapman
Y Lussier
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background: In this paper we focus on the problem of automatically constructing ICD-9-CM coding systems for radiology reports. ICD-9-CM codes are used for billing purposes by health institutes and are assigned to clinical records manually following clinical treatment. Since this labeling task requires expert knowledge in the field of medicine, the process itself is costly and is prone to errors as human annotators have to consider thousands of possible codes when assigning the right ICD-9-CM labels to a document. In this study we use the datasets made available for training and testing automated ICD-9-CM coding systems by the organisers of an International Challenge on Classifying Clinical Free Text Using Natural Language Processing in spring 2007. The challenge itself was dominated by entirely or partly rule-based systems that solve the coding task using a set of hand crafted expert rules. Since the feasibility of the construction of such systems for thousands of ICD codes is indeed questionable, we decided to examine the problem of automatically constructing similar rule sets that turned out to achieve a remarkable accuracy in the shared task challenge. Results: Our results are very promising in the sense that we managed to achieve comparable results with purely hand-crafted ICD-9-CM classifiers. Our best model got a 90.26 % F measure on the training dataset and an 88.93 % F measure on the challenge test dataset, using the micro-averaged Fβ=1 measure, the official evaluatio

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central