Search CORE

1,659 research outputs found

Differentially Private Exponential Random Graphs

Author: A. Goldenberg
A. Hout
C. Dwork
C. Dwork
C.J. Geyer
D.R. Hunter
G. Robins
L. Michell
M. Morris
M. Pearson
M.S. Handcock
M.S. Handcock
O. Frank
P.S. Bearman
S.M. Goodreau
T.A.B. Snijders
V. Karwa
Y.M.J. Woo
Publication venue
Publication date: 01/01/2014
Field of study

We propose methods to release and analyze synthetic graphs in order to protect privacy of individual relationships captured by the social network. Proposed techniques aim at fitting and estimating a wide class of exponential random graph models (ERGMs) in a differentially private manner, and thus offer rigorous privacy guarantees. More specifically, we use the randomized response mechanism to release networks under

\epsilon

-edge differential privacy. To maintain utility for statistical inference, treating the original graph as missing, we propose a way to use likelihood based inference and Markov chain Monte Carlo (MCMC) techniques to fit ERGMs to the produced synthetic networks. We demonstrate the usefulness of the proposed techniques on a real data example.Comment: minor edit

arXiv.org e-Print Archive

Crossref

Research Online

Sharing Social Network Data: Differentially Private Estimation of Exponential-Family Random Graph Models

Author: Carroll R. J.
Chaudhuri A.
Duchi J. C.
Fienberg S.
Geyer C. J.
Hunter D. R.
Karwa V.
Kinney S. K.
Lu W.
Morris M.
Raghunathan T. E.
Reiter J. P.
Snijders T. A.
Zhou Y.
Publication venue
Publication date: 23/09/2016
Field of study

Motivated by a real-life problem of sharing social network data that contain sensitive personal information, we propose a novel approach to release and analyze synthetic graphs in order to protect privacy of individual relationships captured by the social network while maintaining the validity of statistical results. A case study using a version of the Enron e-mail corpus dataset demonstrates the application and usefulness of the proposed techniques in solving the challenging problem of maintaining privacy \emph{and} supporting open access to network data to ensure reproducibility of existing studies and discovering new scientific insights that can be obtained by analyzing such data. We use a simple yet effective randomized response mechanism to generate synthetic networks under

\epsilon

-edge differential privacy, and then use likelihood based inference for missing data and Markov chain Monte Carlo techniques to fit exponential-family random graph models to the generated synthetic networks.Comment: Updated, 39 page

arXiv.org e-Print Archive

Crossref

Research Online

Publishing Community-Preserving Attributed Social Graphs with a Differential Privacy Guarantee

Author: Chen Xihui
Mauw Sjouke
Ramírez-Cruz Yunior
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 31/08/2019
Field of study

We present a novel method for publishing differentially private synthetic attributed graphs. Unlike preceding approaches, our method is able to preserve the community structure of the original graph without sacrificing the ability to capture global structural properties. Our proposal relies on C-AGM, a new community-preserving generative model for attributed graphs. We equip C-AGM with efficient methods for attributed graph sampling and parameter estimation. For the latter, we introduce differentially private computation methods, which allow us to release community-preserving synthetic attributed social graphs with a strong formal privacy guarantee. Through comprehensive experiments, we show that our new model outperforms its most relevant counterparts in synthesising differentially private attributed social graphs that preserve the community structure of the original graph, as well as degree sequences and clustering coefficients

arXiv.org e-Print Archive

Open Repository and Bibliography - Luxembourg

Private Graph Data Release: A Survey

Author: Li Yang
Ng Kee Siong
Purcell Michael
Rakotoarivelo Thierry
Ranbaduge Thilina
Smith David
Publication venue
Publication date: 09/07/2021
Field of study

The application of graph analytics to various domains have yielded tremendous societal and economical benefits in recent years. However, the increasingly widespread adoption of graph analytics comes with a commensurate increase in the need to protect private information in graph databases, especially in light of the many privacy breaches in real-world graph data that was supposed to preserve sensitive information. This paper provides a comprehensive survey of private graph data release algorithms that seek to achieve the fine balance between privacy and utility, with a specific focus on provably private mechanisms. Many of these mechanisms fall under natural extensions of the Differential Privacy framework to graph data, but we also investigate more general privacy formulations like Pufferfish Privacy that can deal with the limitations of Differential Privacy. A wide-ranging survey of the applications of private graph data release mechanisms to social networks, finance, supply chain, health and energy is also provided. This survey paper and the taxonomy it provides should benefit practitioners and researchers alike in the increasingly important area of private graph data release and analysis

arXiv.org e-Print Archive

Private Graphon Estimation for Sparse Graphs

Author: Borgs Christian
Chayes Jennifer T.
Smith Adam
Publication venue
Publication date: 01/01/2015
Field of study

We design algorithms for fitting a high-dimensional statistical model to a large, sparse network without revealing sensitive information of individual members. Given a sparse input graph

G

, our algorithms output a node-differentially-private nonparametric block model approximation. By node-differentially-private, we mean that our output hides the insertion or removal of a vertex and all its adjacent edges. If

G

is an instance of the network obtained from a generative nonparametric model defined in terms of a graphon

W

, our model guarantees consistency, in the sense that as the number of vertices tends to infinity, the output of our algorithm converges to

W

in an appropriate version of the

L_2

norm. In particular, this means we can estimate the sizes of all multi-way cuts in

G

. Our results hold as long as

W

is bounded, the average degree of

G

grows at least like the log of the number of vertices, and the number of blocks goes to infinity at an appropriate rate. We give explicit error bounds in terms of the parameters of the model; in several settings, our bounds improve on or match known nonprivate results.Comment: 36 page

arXiv.org e-Print Archive

CiteSeerX

Revealing Network Structure, Confidentially: Improved Rates for Node-Private Graphon Estimation

Author: Borgs Christian
Chayes Jennifer
Smith Adam
Zadik Ilias
Publication venue
Publication date: 01/01/2018
Field of study

Motivated by growing concerns over ensuring privacy on social networks, we develop new algorithms and impossibility results for fitting complex statistical models to network data subject to rigorous privacy guarantees. We consider the so-called node-differentially private algorithms, which compute information about a graph or network while provably revealing almost no information about the presence or absence of a particular node in the graph. We provide new algorithms for node-differentially private estimation for a popular and expressive family of network models: stochastic block models and their generalization, graphons. Our algorithms improve on prior work, reducing their error quadratically and matching, in many regimes, the optimal nonprivate algorithm. We also show that for the simplest random graph models (

G(n,p)

and

G(n,m)

), node-private algorithms can be qualitatively more accurate than for more complex models---converging at a rate of

\frac{1}{\epsilon^2 n^{3}}

instead of

\frac{1}{\epsilon^2 n^2}

. This result uses a new extension lemma for differentially private algorithms that we hope will be broadly useful

arXiv.org e-Print Archive

Crossref

Boston University Institutional Repository (OpenBU)