Search CORE

14 research outputs found

Evaluating current automatic de-identification methods with Veteran’s health administration clinical documents

Author: BA Beckwith
Brett R South
D Gupta
E Aramaki
F Jeffrey Friedlin
FJ Friedlin
G Szarvas
H Dalianis
I Neamatullah
J Aberdeen
J Gardner
JJ Berman
K Hara
Matthew H Samore
O Uzuner
O Uzuner
Oscar Ferrández
P Ohm
R Grishman
Shuying Shen
SM Meystre
SM Meystre
Stéphane M Meystre
Y Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Evaluating the informatics for integrating biology and the bedside system for clinical research

Author: AJ Butte
D Box
E O'Brien
EA Zerhouni
EA Zerhouni
EA Zerhouni
GA Patton
HR Warner
Joyce A Mitchell
JS Maul
M Skolnick
MP Papazoglou
MP Papazoglou
R Nalichowski
RT Fielding
SN Murphy
SN Murphy
SN Murphy
Stéphane M Meystre
Vikrant G Deshmukh
Publication venue: BioMed Central
Publication date: 01/10/2009
Field of study

Abstract Background Selecting patient cohorts is a critical, iterative, and often time-consuming aspect of studies involving human subjects; informatics tools for helping streamline the process have been identified as important infrastructure components for enabling clinical and translational research. We describe the evaluation of a free and open source cohort selection tool from the Informatics for Integrating Biology and the Bedside (i2b2) group: the i2b2 hive. Methods Our evaluation included the usability and functionality of the i2b2 hive using several real world examples of research data requests received electronically at the University of Utah Health Sciences Center between 2006 - 2008. The hive server component and the visual query tool application were evaluated for their suitability as a cohort selection tool on the basis of the types of data elements requested, as well as the effort required to fulfill each research data request using the i2b2 hive alone. Results We found the i2b2 hive to be suitable for obtaining estimates of cohort sizes and generating research cohorts based on simple inclusion/exclusion criteria, which consisted of about 44% of the clinical research data requests sampled at our institution. Data requests that relied on post-coordinated clinical concepts, aggregate values of clinical findings, or temporal conditions in their inclusion/exclusion criteria could not be fulfilled using the i2b2 hive alone, and required one or more intermediate data steps in the form of pre- or post-processing, modifications to the hive metadata, etc. Conclusion The i2b2 hive was found to be a useful cohort-selection tool for fulfilling common types of requests for research data, and especially in the estimation of initial cohort sizes. For another institution that might want to use the i2b2 hive for clinical research, we recommend that the institution would need to have structured, coded clinical data and metadata available that can be transformed to fit the logical data models of the i2b2 hive, strategies for extracting relevant clinical data from source systems, and the ability to perform substantial pre- and post-processing of these data.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The use of regional platforms for managing electronic health records for the production of regional public health indicators in France

Author: A Coden
A Mykowiecka
A Roberts
Agence des systèmes d'information partagés de santé
Agence des systèmes d'information partagés de santé
Agence des systèmes d'information partagés de santé
B Dean
C Brodley
C Friedman
C Grouin
C Quantin
C Quantin
C Quantin
C Schoen
Centers for Disease Control and Prevention
D Friedman
D Kalra
D Proux
D Proux
E Lau
F Farsi
F Laforest
G Batista
G Ritschard
G Saporta
G Weiss
Groupe de travail-politiques régionales de santé
H Stenzhorn
HJ Murff
I Guyon
International Organization for Standardization
JC Denny
JG Anderson
Journal Officiel de la République Française
M Apkon
M Fieschi
M Klompas
Marie-Hélène Metzger
MH Metzger
MK Obenshain
N Chawla
N Japkowicz
N Terrin
O Boussaïd
P Domingos
P Lenca
Philippe Castets
Q Gicquel
R Krishna
Roger Salamon
S Brossette
S Dudoit
S Pakhomov
S Sakji
SM Meystre
Stéphane Lallich
T Dietterich
T Durand
Thierry Durand
W Stead
WW Chapman
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Evaluating current automatic de-identification methods with Veteran’s health administration clinical documents

Author: Ferrández Oscar
Friedlin F
Meystre Stéphane M
Samore Matthew H
Shen Shuying
South Brett R
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2012
Field of study

Abstract Background The increased use and adoption of Electronic Health Records (EHR) causes a tremendous growth in digital information useful for clinicians, researchers and many other operational purposes. However, this information is rich in Protected Health Information (PHI), which severely restricts its access and possible uses. A number of investigators have developed methods for automatically de-identifying EHR documents by removing PHI, as specified in the Health Insurance Portability and Accountability Act “Safe Harbor” method. This study focuses on the evaluation of existing automated text de-identification methods and tools, as applied to Veterans Health Administration (VHA) clinical documents, to assess which methods perform better with each category of PHI found in our clinical notes; and when new methods are needed to improve performance. Methods We installed and evaluated five text de-identification systems “out-of-the-box” using a corpus of VHA clinical documents. The systems based on machine learning methods were trained with the 2006 i2b2 de-identification corpora and evaluated with our VHA corpus, and also evaluated with a ten-fold cross-validation experiment using our VHA corpus. We counted exact, partial, and fully contained matches with reference annotations, considering each PHI type separately, or only one unique ‘PHI’ category. Performance of the systems was assessed using recall (equivalent to sensitivity) and precision (equivalent to positive predictive value) metrics, as well as the F2-measure. Results Overall, systems based on rules and pattern matching achieved better recall, and precision was always better with systems based on machine learning approaches. The highest “out-of-the-box” F2-measure was 67% for partial matches; the best precision and recall were 95% and 78%, respectively. Finally, the ten-fold cross validation experiment allowed for an increase of the F2-measure to 79% with partial matches. Conclusions The “out-of-the-box” evaluation of text de-identification systems provided us with compelling insight about the best methods for de-identification of VHA clinical documents. The errors analysis demonstrated an important need for customization to PHI formats specific to VHA documents. This study informed the planning and development of a “best-of-breed” automatic de-identification application for VHA clinical text.</p

Directory of Open Access Journals

Textractor: a hybrid system for medications and reason for their prescription extraction from clinical text documents

Author: Hurdle John F
Meystre Stéphane M
Shen Shuying
South Brett R
Thibault Julien
Publication venue: BMJ Group
Publication date
Field of study

Crossref

PubMed Central