Search CORE

68,382 research outputs found

Mining Frequent Graph Patterns with Differential Privacy

Author: Geweke J.
Gilks W.
Karwa V.
Rubinstein R.
Williams O.
Yan X.
Publication venue
Publication date: 01/03/2013
Field of study

Discovering frequent graph patterns in a graph database offers valuable information in a variety of applications. However, if the graph dataset contains sensitive data of individuals such as mobile phone-call graphs and web-click graphs, releasing discovered frequent patterns may present a threat to the privacy of individuals. {\em Differential privacy} has recently emerged as the {\em de facto} standard for private data analysis due to its provable privacy guarantee. In this paper we propose the first differentially private algorithm for mining frequent graph patterns. We first show that previous techniques on differentially private discovery of frequent {\em itemsets} cannot apply in mining frequent graph patterns due to the inherent complexity of handling structural information in graphs. We then address this challenge by proposing a Markov Chain Monte Carlo (MCMC) sampling based algorithm. Unlike previous work on frequent itemset mining, our techniques do not rely on the output of a non-private mining algorithm. Instead, we observe that both frequent graph pattern mining and the guarantee of differential privacy can be unified into an MCMC sampling framework. In addition, we establish the privacy and utility guarantee of our algorithm and propose an efficient neighboring pattern counting technique as well. Experimental results show that the proposed algorithm is able to output frequent patterns with good precision

arXiv.org e-Print Archive

CiteSeerX

Crossref

Quantifying Privacy: A Novel Entropy-Based Measure of Disclosure Risk

Author: A Oganian
C Dwork
CCM Fung
CJ Skinner
D Lambert
DE Denning
F Al-Saggaf
GT Duncan
JR Griggs
L Brankovic
L Brankovic
L Brankovic
L Brankovic
L Brankovic
L Brankovic
L Brankovic
L Brankovic
L Sankar
L Willenborg
M Trottini
N Lopez
N López
NR Adam
P Horak
P Tendick
R Ahlswede
S Fletcher
S Morris
T King
V Estivill-Castro
V Estivill-Castro
WA Fuller
WE Winkler
WE Yancey
Y Al-Saggaf
Publication venue
Publication date: 07/09/2014
Field of study

It is well recognised that data mining and statistical analysis pose a serious treat to privacy. This is true for financial, medical, criminal and marketing research. Numerous techniques have been proposed to protect privacy, including restriction and data modification. Recently proposed privacy models such as differential privacy and k-anonymity received a lot of attention and for the latter there are now several improvements of the original scheme, each removing some security shortcomings of the previous one. However, the challenge lies in evaluating and comparing privacy provided by various techniques. In this paper we propose a novel entropy based security measure that can be applied to any generalisation, restriction or data modification technique. We use our measure to empirically evaluate and compare a few popular methods, namely query restriction, sampling and noise addition.Comment: 20 pages, 4 figure

arXiv.org e-Print Archive

University of Newcastle's Digital Repository

Crossref