Search CORE

16 research outputs found

A Game Theoretic Framework for Analyzing Re-Identification Risk

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date: 01/01/2015
Field of study

<div>Given the potential wealth of insights in personal data the big databases can provide, many organizations aim to share data while protecting privacy by sharing de-identified data, but are concerned because various demonstrations show such data can be re-identified. Yet these investigations focus on how attacks can be perpetrated, not the likelihood they will be realized. This paper introduces a game theoretic framework that enables a publisher to balance re-identification risk with the value of sharing data, leveraging a natural assumption that a recipient only attempts re-identification if its potential gains outweigh the costs. We apply the framework to a real case study, where the value of the data to the publisher is the actual grant funding dollar amounts from a national sponsor and the re-identification gain of the recipient is the fine paid to a regulator for violation of federal privacy rules. There are three notable findings: 1) it is possible to achieve zero risk, in that the recipient never gains from re-identification, while sharing almost as much data as the optimal solution that allows for a small amount of risk; 2) the zero-risk solution enables sharing much more data than a commonly invoked de-identification policy of the U.S. Health Insurance Portability and Accountability Act (HIPAA); and 3) a sensitivity analysis demonstrates these findings are robust to order-of-magnitude changes in player losses and gains. In combination, these findings provide support that such a framework can enable pragmatic policy decisions about de-identified data sharing.</div

Directory of Open Access Journals

FigShare

A comparison of four de-identification policies for the case study on performance measures.

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date
Field of study

SH: Safe Harbor. GI: Generalization Intensity.A comparison of four de-identification policies for the case study on performance measures.</p

FigShare

Payoff across strategies.

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date
Field of study

Payoffs for the record ⟨48, Asian, Female, 38363⟩ across all strategies.</p

FigShare

Scatter-plot of Payoff Differences.

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date
Field of study

Detailed distributions of the publisher’s payoff differences (left) and the adversary’s payoff differences (right) between games and HIPAA Safe Harbor (SH).</p

FigShare

DGH for Age.

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date
Field of study

The Domain Generation Hierarchy (DGH) for the attribute Age in the case study.</p

FigShare

DGH for Race.

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date
Field of study

The Domain Generation Hierarchy (DGH) for the attribute Race in the case study.</p

FigShare

Data used to fit the LME model after triage

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
James Gaupp (5032607)
Murat Kantarcioglu (154695)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Yongtai Liu (5032601)
Zhijun Yin (5032604)
Zhiyu Wan (711338)
Publication venue
Publication date: 27/09/2017
Field of study

This dataset contains the data that is used in the statistical analysis of the paper. This dataset is obtained from performing triage as described in the paper to the dataset with title "Publication and dbGaP datasets mapping before triage"

Dryad Digital Repository (Duke University)

FigShare

Recent notable HIPAA breach violation cases as reported by the U.S. Department of Health and Human Services.

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date
Field of study

Recent notable HIPAA breach violation cases as reported by the U.S. Department of Health and Human Services.</p

FigShare

Histogram of Payoff Differences.

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date
Field of study

Distributions of the publisher’s payoff differences (left) and the adversary’s payoff differences (right) between games and HIPAA Safe Harbor (SH).</p

FigShare

A performance comparison of the de-identification game solving approaches.

Author: Bradley A. Malin (284876)
Ellen Wright Clayton (711341)
Murat Kantarcioglu (154695)
Ranjit Ganta (711342)
Raymond Heatherly (711343)
Weiyi Xia (711340)
Yevgeniy Vorobeychik (711339)
Zhiyu Wan (711338)
Publication venue
Publication date
Field of study

BIS: Backward Induction Search. LBS: Lattice-Based Search. Payoff difference means the absolute difference of payoff for one record between a heuristic-driven approach and the baseline BIS approach.A performance comparison of the de-identification game solving approaches.</p

FigShare