Search CORE

16,361 research outputs found

A Taxonomy of Privacy-Preserving Record Linkage Techniques

Author: Christen Peter
Vatsalan Dinusha
Verykios Vassilios
Publication venue: 'Elsevier BV'
Publication date: 07/12/2015
Field of study

The process of identifying which records in two or more databases correspond to the same entity is an important aspect of data quality activities such as data pre-processing and data integration. Known as record linkage, data matching or entity resolution, this process has attracted interest from researchers in fields such as databases and data warehousing, data mining, information systems, and machine learning. Record linkage has various challenges, including scalability to large databases, accurate matching and classification, and privacy and confidentiality. The latter challenge arises because commonly personal identifying data, such as names, addresses and dates of birth of individuals, are used in the linkage process. When databases are linked across organizations, the issue of how to protect the privacy and confidentiality of such sensitive information is crucial to successful application of record linkage. In this paper we present an overview of techniques that allow the linking of databases between organizations while at the same time preserving the privacy of these data. Known as 'privacy-preserving record linkage' (PPRL), various such techniques have been developed. We present a taxonomy of PPRL techniques to characterize these techniques along 15 dimensions, and conduct a survey of PPRL techniques. We then highlight shortcomings of current techniques and discuss avenues for future research

The Australian National University

Accuracy and completeness of patient pathways – the benefits of national data linkage in Australia

Author: A Ferrante
Adrian P. Brown
Anna M. Ferrante
B Sibthorpe
BA Virnig
C Kelman
CDAJ Holman
G Bishop
G Lawrence
H Newcombe
HB Newcombe
Jacqueline K. Bauer
James B. Semmens
James H. Boyd
K Field
K Harron
K Harron
Katrina Spilsbury
Kevin McInneny
KM Campbell
LE Gill
LL Roos
MA Jaro
Margo Gillies
NCRIS
P Christen
PM Frommer
R Karmel
Sean M. Randall
SW Kendrick
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background - The technical challenges associated with national data linkage, and the extent of cross-border population movements, are explored as part of a pioneering research project. The project involved linking state-based hospital admission records and death registrations across Australia for a national study of hospital related deaths. Methods - The project linked over 44 million morbidity and mortality records from four Australian states between 1st July 1999 and 31st December 2009 using probabilistic methods. The accuracy of the linkage was measured through a comparison with jurisdictional keys sourced from individual states. The extent of cross-border population movement between these states was also assessed. Results - Data matching identified almost twelve million individuals across the four Australian states. The percentage of individuals from one state with records found in another ranged from 3-5 %. Using jurisdictional keys to measure linkage quality, results indicate a high matching efficiency (F measure 97 to 99 %), with linkage processing taking only a matter of days. Conclusions - The results demonstrate the feasibility and accuracy of undertaking cross jurisdictional linkage for national research. The benefits are substantial, particularly in relation to capturing the full complement of records in patient pathways as a result of cross-border population movements. The project identified a sizeable ‘mobile’ population with hospital records in more than one state. Research studies that focus on a single jurisdiction will under-enumerate the extent of hospital usage by individuals in the population. It is important that researchers understand and are aware of the impact of this missing hospital activity on their studies. The project highlights the need for an efficient and accurate data linkage system to support national research across Australia

Crossref

Springer - Publisher Connector

PubMed Central

espace@Curtin

Tree Based Scalable Indexing for Multi-Party Privacy-Preserving Record Linkage

Author: Christen Peter
Ranbaduge Thilina
Vatsalan Dinusha
Publication venue: Australian Computer Society Inc.
Publication date: 09/12/2015
Field of study

The Australian National University