Search CORE

19 research outputs found

Relaxation Height in Energy Landscapes: an Application to Multiple Metastable States

Author: A. Bovier
A. Bovier
A. Bovier
A. Bovier
D. Mehta
D.J. Wales
E. Olivieri
E. Olivieri
E.N.M. Cirillo
E.N.M. Cirillo
E.N.M. Cirillo
E.N.M. Cirillo
E.N.M. Cirillo
E.N.M. Cirillo
Emilio N. M. Cirillo
F. Hollander den
F. Hollander den
F. Hollander den
F. Manzo
F. Manzo
F.R. Nardi
Francesca R. Nardi
G. Grinstein
H.W. Capel
J. Beltrán
L. Alonso
M. Blume
M. Blume
M. Bousquet-Mélou
M.I. Friedlin
R.J. Glauber
S. Bigelis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

The study of systems with multiple (not necessarily degenerate) metastable states presents subtle difficulties from the mathematical point of view related to the variational problem that has to be solved in these cases. We introduce the notion of relaxation height in a general energy landscape and we prove sufficient conditions which are valid even in presence of multiple metastable states. We show how these results can be used to approach the problem of multiple metastable states via the use of the modern theories of metastability. We finally apply these general results to the Blume--Capel model for a particular choice of the parameters ensuring the existence of two multiple, and not degenerate in energy, metastable states

arXiv.org e-Print Archive

Archivio della ricerca- Università di Roma La Sapienza

Evaluating current automatic de-identification methods with Veteran’s health administration clinical documents

Author: BA Beckwith
Brett R South
D Gupta
E Aramaki
F Jeffrey Friedlin
FJ Friedlin
G Szarvas
H Dalianis
I Neamatullah
J Aberdeen
J Gardner
JJ Berman
K Hara
Matthew H Samore
O Uzuner
O Uzuner
Oscar Ferrández
P Ohm
R Grishman
Shuying Shen
SM Meystre
SM Meystre
Stéphane M Meystre
Y Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Automatic de-identification of textual documents in the electronic health record: a review of recent research

Author: B Wellner
BA Beckwith
Brett R South
C Friedman
D Gupta
DA Dorr
E Aramaki
EM Fielstein
F Jeffrey Friedlin
FJ Friedlin
FP Morrison
G Szarvas
G Szarvas
GPO U.S
GPO U.S
H Cunningham
I Neamatullah
J Gardner
JJ Berman
K Atkinson
K Hara
L Sweeney
Matthew H Samore
NCI
NLM
NLM
NLM
O Uzuner
O Uzuner
O Uzuner
O Uzuner
P Ruch
RK Taira
Shuying Shen
SM Meystre
SM Thomas
SM Thomas
Stephane M Meystre
Y Guo
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background In the United States, the Health Insurance Portability and Accountability Act (HIPAA) protects the confidentiality of patient data and requires the informed consent of the patient and approval of the Internal Review Board to use data for research purposes, but these requirements can be waived if data is de-identified. For clinical data to be considered de-identified, the HIPAA "Safe Harbor" technique requires 18 data elements (called PHI: Protected Health Information) to be removed. The de-identification of narrative text documents is often realized manually, and requires significant resources. Well aware of these issues, several authors have investigated automated de-identification of narrative text documents from the electronic health record, and a review of recent research in this domain is presented here. Methods This review focuses on recently published research (after 1995), and includes relevant publications from bibliographic queries in PubMed, conference proceedings, the ACM Digital Library, and interesting publications referenced in already included papers. Results The literature search returned more than 200 publications. The majority focused only on structured data de-identification instead of narrative text, on image de-identification, or described manual de-identification, and were therefore excluded. Finally, 18 publications describing automated text de-identification were selected for detailed analysis of the architecture and methods used, the types of PHI detected and removed, the external resources used, and the types of clinical documents targeted. All text de-identification systems aimed to identify and remove person names, and many included other types of PHI. Most systems used only one or two specific clinical document types, and were mostly based on two different groups of methodologies: pattern matching and machine learning. Many systems combined both approaches for different types of PHI, but the majority relied only on pattern matching, rules, and dictionaries. Conclusions In general, methods based on dictionaries performed better with PHI that is rarely mentioned in clinical text, but are more difficult to generalize. Methods based on machine learning tend to perform better, especially with PHI that is not mentioned in the dictionaries used. Finally, the issues of anonymization, sufficient performance, and "over-scrubbing" are discussed in this publication.</p

Crossref

IUPUIScholarWorks

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Fluctuations in Nonequilibrium Statistical Mechanics: Models, Mathematical Theory, Physical Mechanisms

The fluctuations in nonequilibrium systems are under intense theoretical and experimental investigation. Topical ``fluctuation relations'' describe symmetries of the statistical properties of certain observables, in a variety of models and phenomena. They have been derived in deterministic and, later, in stochastic frameworks. Other results first obtained for stochastic processes, and later considered in deterministic dynamics, describe the temporal evolution of fluctuations. The field has grown beyond expectation: research works and different perspectives are proposed at an ever faster pace. Indeed, understanding fluctuations is important for the emerging theory of nonequilibrium phenomena, as well as for applications, such as those of nanotechnological and biophysical interest. However, the links among the different approaches and the limitations of these approaches are not fully understood. We focus on these issues, providing: a) analysis of the theoretical models; b) discussion of the rigorous mathematical results; c) identification of the physical mechanisms underlying the validity of the theoretical predictions, for a wide range of phenomena.Comment: 44 pages, 2 figures. To appear in Nonlinearity (2007

arXiv.org e-Print Archive

CiteSeerX

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Improving sensitivity of machine learning methods for automated case identification from free-text electronic medical records

Author: A Cunningham
A Nicholson
A Vlug
C Chen
C Clark
C Drummond
C Hsu
C-C Chang
CP Chung
CX Ling
D Mease
E Apostolova
EA Garcia
Elif F Sen
FS Roque
GK Savova
GK Savova
GM Weiss
GN Norén
H Harkema
J Cohen
J Friedlin
J Van Hulse
J Van Hulse
JA Linder
JA Singh
Jan A Kors
Jan C van Blijderveen
JF Hurdle
JR Quinlan
K McCarthy
KP Liao
KS Boockvar
LM Taft
M Hall
Martijn J Schuemie
MH Stanfill
Miriam CJM Sturkenboom
MJ Schuemie
N Japkowicz
N Japkowicz
NV Chawla
NV Chawla
P Domingos
P Ruch
PK Chan
PL Elkin
R Akbani
R Farkas
R Setiono
RH Perlis
S Pakhomov
SD Persell
SL Salzberg
SM Meystre
T Wang
W Adler
WW Chapman
WW Cohen
X Liu
Y Sun
Y Sun
Z Wang
Z Zhou
Zubair Afzal
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Evaluating current automatic de-identification methods with Veteran’s health administration clinical documents

Author: Ferrández Oscar
Friedlin F
Meystre Stéphane M
Samore Matthew H
Shen Shuying
South Brett R
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2012
Field of study

Abstract Background The increased use and adoption of Electronic Health Records (EHR) causes a tremendous growth in digital information useful for clinicians, researchers and many other operational purposes. However, this information is rich in Protected Health Information (PHI), which severely restricts its access and possible uses. A number of investigators have developed methods for automatically de-identifying EHR documents by removing PHI, as specified in the Health Insurance Portability and Accountability Act “Safe Harbor” method. This study focuses on the evaluation of existing automated text de-identification methods and tools, as applied to Veterans Health Administration (VHA) clinical documents, to assess which methods perform better with each category of PHI found in our clinical notes; and when new methods are needed to improve performance. Methods We installed and evaluated five text de-identification systems “out-of-the-box” using a corpus of VHA clinical documents. The systems based on machine learning methods were trained with the 2006 i2b2 de-identification corpora and evaluated with our VHA corpus, and also evaluated with a ten-fold cross-validation experiment using our VHA corpus. We counted exact, partial, and fully contained matches with reference annotations, considering each PHI type separately, or only one unique ‘PHI’ category. Performance of the systems was assessed using recall (equivalent to sensitivity) and precision (equivalent to positive predictive value) metrics, as well as the F2-measure. Results Overall, systems based on rules and pattern matching achieved better recall, and precision was always better with systems based on machine learning approaches. The highest “out-of-the-box” F2-measure was 67% for partial matches; the best precision and recall were 95% and 78%, respectively. Finally, the ten-fold cross validation experiment allowed for an increase of the F2-measure to 79% with partial matches. Conclusions The “out-of-the-box” evaluation of text de-identification systems provided us with compelling insight about the best methods for de-identification of VHA clinical documents. The errors analysis demonstrated an important need for customization to PHI formats specific to VHA documents. This study informed the planning and development of a “best-of-breed” automatic de-identification application for VHA clinical text.</p

Directory of Open Access Journals

Text de-identification for privacy protection: A study of its impact on clinical text information content

Author: Aberdeen
Beckwith
Berman
Brett R. South
Deleger
F. Jeffrey Friedlin
Ferrandez
Ferrandez
Friedlin
Matthew H. Samore
Meystre
Neamatullah
Ruch
Savova
Shuying Shen
Stéphane M. Meystre
Sweeney
Taira
Uzuner
Uzuner
Uzuner
Óscar Ferrández
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref