Search CORE

89 research outputs found

Discerning Tumor Status from Unstructured MRI Reports—Completeness of Information in Existing Reports and Utility of Automated Natural Language Processing

Author: AB Miller
BI Reiner
BJ Thomas
Bradley J. Erickson
C Cortes
CL Sistrom
CP Langlotz
E Galanis
G Hripcsak
G Hripcsak
G Hripcsak
GB Melton
GK Savova
Guergana K. Savova
I McCowan
IA McCowan
JC Denny
Jiaping Zheng
JL Hobby
JS Elkins
KJ Dreyer
L Berlin
L Zhou
Lionel T. E. Cheng
NR Dunnick
P Therasse
PM Hickey
R Khorasani
RK Taira
S Pakhomov
SS Naik
Y Lin
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

Information in electronic medical records is often in an unstructured free-text format. This format presents challenges for expedient data retrieval and may fail to convey important findings. Natural language processing (NLP) is an emerging technique for rapid and efficient clinical data retrieval. While proven in disease detection, the utility of NLP in discerning disease progression from free-text reports is untested. We aimed to (1) assess whether unstructured radiology reports contained sufficient information for tumor status classification; (2) develop an NLP-based data extraction tool to determine tumor status from unstructured reports; and (3) compare NLP and human tumor status classification outcomes. Consecutive follow-up brain tumor magnetic resonance imaging reports (2000–2007) from a tertiary center were manually annotated using consensus guidelines on tumor status. Reports were randomized to NLP training (70%) or testing (30%) groups. The NLP tool utilized a support vector machines model with statistical and rule-based outcomes. Most reports had sufficient information for tumor status classification, although 0.8% did not describe status despite reference to prior examinations. Tumor size was unreported in 68.7% of documents, while 50.3% lacked data on change magnitude when there was detectable progression or regression. Using retrospective human classification as the gold standard, NLP achieved 80.6% sensitivity and 91.6% specificity for tumor status determination (mean positive predictive value, 82.4%; negative predictive value, 92.0%). In conclusion, most reports contained sufficient information for tumor status determination, though variable features were used to describe status. NLP demonstrated good accuracy for tumor status classification and may have novel application for automated disease status classification from electronic databases

Crossref

Springer - Publisher Connector

PubMed Central

Clinical narrative analytics challenges

Author: A Coden
A Rodríguez-González
A Rodríguez-González
AA Thomas
BL Humphreys
C Friedman
C Friedman
C Friedman
D Ferrucci
DA Hanauer
G Hripcsak
GK Savova
M Taboada
O Ben-Assuli
P Zweigenbaum
PM Pietrzyk
QT Zeng
R Costumero
R Costumero
R Costumero
SM Meystre
Y Ji
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Precision medicine or evidence based medicine is based on the extraction of knowledge from medical records to provide individuals with the appropriate treatment in the appropriate moment according to the patient features. Despite the efforts of using clinical narratives for clinical decision support, many challenges have to be faced still today such as multilinguarity, diversity of terms and formats in different services, acronyms, negation, to name but a few. The same problems exist when one wants to analyze narratives in literature whose analysis would provide physicians and researchers with highlights. In this talk we will analyze challenges, solutions and open problems and will analyze several frameworks and tools that are able to perform NLP over free text to extract medical entities by means of Named Entity Recognition process. We will also analyze a framework we have developed to extract and validate medical terms. In particular we present two uses cases: (i) medical entities extraction of a set of infectious diseases description texts provided by MedlinePlus and (ii) scales of stroke identification in clinical narratives written in Spanish

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Annotation analysis for testing drug safety signals using unstructured clinical notes

Author: A Bate
C Friedman
D Classen
D Dore
D Graham
DW Bates
G Alterovitz
GK Savova
H Cao
KD Shetty
L Ohno-Machado
L Tari
MJ Goldacre
N Tatonetti
NF Noy
NH Shah
NH Shah
O Bodenreider
P Khatri
P LePendu
P LePendu
P Stang
PM Coloma
PM Nadkarni
R Harpaz
R Harpaz
R Harpaz
RP Radecki
S Paumier
S Schneeweiss
S Schneeweiss
S Weiss-Smith
SJ Reisinger
W Chapman
WW Chapman
WW Chapman
WW Chapman
X Wang
Y Liu
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

BackgroundThe electronic surveillance for adverse drug events is largely based upon the analysis of coded data from reporting systems. Yet, the vast majority of electronic health data lies embedded within the free text of clinical notes and is not gathered into centralized repositories. With the increasing access to large volumes of electronic medical data-in particular the clinical notes-it may be possible to computationally encode and to test drug safety signals in an active manner.ResultsWe describe the application of simple annotation tools on clinical text and the mining of the resulting annotations to compute the risk of getting a myocardial infarction for patients with rheumatoid arthritis that take Vioxx. Our analysis clearly reveals elevated risks for myocardial infarction in rheumatoid arthritis patients taking Vioxx (odds ratio 2.06) before 2005.ConclusionsOur results show that it is possible to apply annotation analysis methods for testing hypotheses about drug safety using electronic medical records

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

DIAL UCLouvain

Detection of sentence boundaries and abbreviations in clinical narratives

Author: B Di Eugenio
B Schölkopf
C Cortes
C Friedman
C Friedman
C Friedman
CD Manning
CM Bishop
CW Hsu
D Gillick
E Buyko
G Hripcsak
GK Savova
H Suominen
H Xu
H Xu
I Guyon
J Patrick
M Kreuzthaler
M Wiesenauer
MA Hearst
Markus Kreuzthaler
N Cristianini
N Okazaki
O Patterson
R O'Donnell
SM Meystre
Stefan Schulz
T Dunning
T Hagerup
T Joachims
T Joachims
T Kiss
T Kiss
T Kiss
Y Wu
Y Wu
Y Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

ContextD: an algorithm to identify contextual properties of medical terms in a Dutch clinical corpus

Author: A Vlug
C Friedman
C Friedman
E Apostolova
E Velldal
Ewoud Pons
GK Savova
H Harkema
H Kilicoglu
H Xu
I Goldin
J Cohen
Jan A Kors
L Deléger
LM Christensen
M Light
M Skeppstedt
Martijn J Schuemie
Miriam CJM Sturkenboom
Ning Kang
NP Cruz Díaz
O Bodenreider
O Uzuner
PB Jensen
PG Mutalik
PL Elkin
QT Zeng
RM Reeves
S Agarwal
S Goryachev
U Hahn
V Vincze
W Sun
WW Chapman
WW Chapman
Y Huang
Zubair Afzal
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

NOBLE – Flexible concept recognition for large-scale biomedical natural language processing

Author: A Smith
AR Aronson
C Friedman
C Friedman
C Funk
C-N Hsu
CD Manning
D Hanauer
D Tikk
Elizabeth Legowski
Eugene Tseytlin
G Divita
Girish Chavan
GK Savova
J Zheng
JJ Berman
JJ Berman
JJ Cimino
Julia Corrigan
K Liu
K Liu
K Liu
KB Cohen
Kevin Mitchell
M Bada
MA Tanenblatt
ML Zeng
NF de Keizer
NF de Keizer
NH Shah
PM Nadkarni
Rebecca S. Jacobson
RL Trask
RS Crowley
SA Stewart
T Mitsumori
TR Gruber
Z Lu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Extracting a stroke phenotype risk factor from Veteran Health Administration clinical reports: an information content analysis

Author: AN Kho
B Chapman
BR South
C Shivade
CA McCarty
DL Brown
DL Mowery
DL Mowery
E Birman-Deych
EF Gershanik
EM Cheng
G Hripcsak
G Hripcsak
GK Savova
H Harkema
I Kullo
IJ Kullo
J Pathak
JC Denny
JH Garvin
M Conway
O Bodenreider
RA Wilke
W Scuba
WK Thompson
WW Chapman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

In the United States, 795,000 people suffer strokes each year; 10-15 % of these strokes can be attributed to stenosis caused by plaque in the carotid artery, a major stroke phenotype risk factor. Studies comparing treatments for the management of asymptom

Crossref

Springer - Publisher Connector

eScholarship - University of California

Knowledge Author: facilitating user-driven, domain content development to support clinical information extraction

Author: BE Chapman
C Friedman
C Tao
DL Mowery
E Aramaki
GK Savova
I Spasic
JH Chiang
L Hunter
L Li
MA Al-Haddad
O Bodenreider
P Moreno
QT Zeng
S Pakhomov
ST Wu
TC Rindflesch
TT Vleck Van
W Hsu
X Wang
Y Lu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Selected heterozygosity at cis-regulatory sequences increases the expression homogeneity of a cell population in humans

Author: A Auton
A Cabral
A Chess
A Mathelier
A McKenna
A Sandelin
AD Johnson
AK Tehranchi
AL Hughes
B He
B Jackson
B Lehner
BE Bernstein
BPH Metzger
BS Weir
C-M Ghim
CE Grant
Cheol-Min Ghim
D Charlesworth
D Sellis
D Smirnov
D Su
D Vlieghe
D Volfson
DB Dubal
DE Arking
DJ Penn
E Grundberg
E Roth
E Wingender
EM Leffler
F Tajima
G Pasvol
GK Marinov
J Paulsson
J Stewart-Ornstein
J Wang
JA Bailey
JA Bailey
JC Bryne
JE Wigginton
JK Pickrell
Juneil Jang
Jung Kyoon Choi
Kang Seon Lee
M DeGiorgio
M Fumagalli
M Kaern
MA Beaumont
MH Schierup
Min Kyung Sung
MT Maurano
N Friedman
O Gokcumen
P Brennecke
P Martin
PC ’t Hoen
R Andersson
R Cagliani
RE Consortium
RE Thurman
RR Hudson
S Neph
SA Schroeder
SI Wright
T Derrien
T Kambayashi
T Lappalainen
The 1000 Genomes Project Consortium
V Matys
V Matys
V Savova
Y Benjamini
Y Taniguchi
Z Tang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: Examples of heterozygote advantage in humans are scarce and limited to protein-coding sequences. Here, we attempt a genome-wide functional inference of advantageous heterozygosity at cis-regulatory regions. Results: The single-nucleotide polymorphisms bearing the signatures of balancing selection are enriched in active cis-regulatory regions of immune cells and epithelial cells, the latter of which provide barrier function and innate immunity. Examples associated with ancient trans-specific balancing selection are also discovered. Allelic imbalance in chromatin accessibility and divergence in transcription factor motif sequences indicate that these balanced polymorphisms cause distinct regulatory variation. However, a majority of these variants show no association with the expression level of the target gene. Instead, single-cell experimental data for gene expression and chromatin accessibility demonstrate that heterozygous sequences can lower cell-to-cell variability in proportion to selection strengths. This negative correlation is more pronounced for highly expressed genes and consistently observed when using different data and methods. Based on mathematical modeling, we hypothesize that extrinsic noise from fluctuations in transcription factor activity may be amplified in homozygotes, whereas it is buffered in heterozygotes. While high expression levels are coupled with intrinsic noise reduction, regulatory heterozygosity can contribute to the suppression of extrinsic noise. Conclusions: This mechanism may confer a selective advantage by increasing cell population homogeneity and thereby enhancing the collective action of the cells, especially of those involved in the defense systems in humansope

Crossref

Springer - Publisher Connector

PubMed Central

ScholarWorks@UNIST

Graph-based signal integration for high-throughput phenotyping

Author: A Aronson
A Malik
C Friedman
Charles F Bearden
D Widdows
Devika Subramanian
Elmer V Bernstam
EV Bernstam
GA Miller
GK Savova
J Singh
J van der Lei
Jorge R Herskovic
JR Herskovic
KP Liao
LB Chibnik
M Boyd
M Fiszman
M González-Fernández
M Nahm
MH Stanfill
ML Miller
Pamela A Bozzo-Silva
S Thirumurthi
T Cohen
T Cohen
Trevor Cohen
W Kintsch
W Widdows
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref