Search CORE

18 research outputs found

Automatic de-identification of textual documents in the electronic health record: a review of recent research

Author: B Wellner
BA Beckwith
Brett R South
C Friedman
D Gupta
DA Dorr
E Aramaki
EM Fielstein
F Jeffrey Friedlin
FJ Friedlin
FP Morrison
G Szarvas
G Szarvas
GPO U.S
GPO U.S
H Cunningham
I Neamatullah
J Gardner
JJ Berman
K Atkinson
K Hara
L Sweeney
Matthew H Samore
NCI
NLM
NLM
NLM
O Uzuner
O Uzuner
O Uzuner
O Uzuner
P Ruch
RK Taira
Shuying Shen
SM Meystre
SM Thomas
SM Thomas
Stephane M Meystre
Y Guo
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background In the United States, the Health Insurance Portability and Accountability Act (HIPAA) protects the confidentiality of patient data and requires the informed consent of the patient and approval of the Internal Review Board to use data for research purposes, but these requirements can be waived if data is de-identified. For clinical data to be considered de-identified, the HIPAA "Safe Harbor" technique requires 18 data elements (called PHI: Protected Health Information) to be removed. The de-identification of narrative text documents is often realized manually, and requires significant resources. Well aware of these issues, several authors have investigated automated de-identification of narrative text documents from the electronic health record, and a review of recent research in this domain is presented here. Methods This review focuses on recently published research (after 1995), and includes relevant publications from bibliographic queries in PubMed, conference proceedings, the ACM Digital Library, and interesting publications referenced in already included papers. Results The literature search returned more than 200 publications. The majority focused only on structured data de-identification instead of narrative text, on image de-identification, or described manual de-identification, and were therefore excluded. Finally, 18 publications describing automated text de-identification were selected for detailed analysis of the architecture and methods used, the types of PHI detected and removed, the external resources used, and the types of clinical documents targeted. All text de-identification systems aimed to identify and remove person names, and many included other types of PHI. Most systems used only one or two specific clinical document types, and were mostly based on two different groups of methodologies: pattern matching and machine learning. Many systems combined both approaches for different types of PHI, but the majority relied only on pattern matching, rules, and dictionaries. Conclusions In general, methods based on dictionaries performed better with PHI that is rarely mentioned in clinical text, but are more difficult to generalize. Methods based on machine learning tend to perform better, especially with PHI that is not mentioned in the dictionaries used. Finally, the issues of anonymization, sufficient performance, and "over-scrubbing" are discussed in this publication.</p

Crossref

IUPUIScholarWorks

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Evaluating the extent of reusability of CYP2C19 genotype data among patients genotyped for antiplatelet therapy selection

Author: Angiolillo Dominick P.
Beitelshees Amber L.
Cavallari Larisa H.
Dillon Chrisly
El Rouby Nihal
Empey Philip E.
Fielstein Elliot M.
Franchi Francesco
Johnson Julie A.
Limdi Nita A.
Ong Henry H.
Peterson Joshua F.
Rosenman Marc B.
Skaar Todd C.
Stevenson James M.
Tuteja Sony
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2020
Field of study

Purpose Genotype-guided antiplatelet therapy is increasingly being incorporated into clinical care. The purpose of this study is to determine the extent to which patients initially genotyped for CYP2C19 to guide antiplatelet therapy were prescribed additional medications affected by CYP2C19. Methods We assembled a cohort of patients from eight sites performing CYP2C19 genotyping to inform antiplatelet therapy. Medication orders were evaluated from time of genotyping through one year. The primary endpoint was the proportion of patients prescribed two or more CYP2C19 substrates. Secondary endpoints were the proportion of patients with a drug–genotype interaction and time to receiving a CYP2C19 substrate. Results Nine thousand one hundred ninety-one genotyped patients (17% nonwhite) with a mean age of 68 ± 3 years were evaluated; 4701 (51%) of patients received two or more CYP2C19 substrates and 3835 (42%) of patients had a drug–genotype interaction. The average time between genotyping and CYP2C19 substrate other than antiplatelet therapy was 25 ± 10 days. Conclusions More than half of patients genotyped in the setting of CYP2C19-guided antiplatelet therapy received another medication impacted by CYP2C19 in the following year. Given that genotype is stable for a patient’s lifetime, this finding has implications for cost effectiveness, patient care, and treatment outcomes beyond the indication for which it was originally performed

IUPUIScholarWorks

Fatores geradores de estresse para atletas da categoria de base do futebol de campo

Crossref

Assessing the Impact of Magic Johnson's HIV Positive Announcement on a University Campus

Author: Bruce K. E.
Bruce K. E.
DiClemente R. J.
Elmes D. G.
Fielstein E. M.
Gayle H. D.
Langer L. M.
Mickler S.
Shrum J. C.
Sigelman C. K.
Weinstein N. D.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref