Search CORE

4,666 research outputs found

Differentially Private Data Analysis of Social Networks via Restricted Sensitivity

Author: Blocki Jeremiah
Blum Avrim
Datta Anupam
Sheffet Or
Publication venue
Publication date: 01/01/2013
Field of study

We introduce the notion of restricted sensitivity as an alternative to global and smooth sensitivity to improve accuracy in differentially private data analysis. The definition of restricted sensitivity is similar to that of global sensitivity except that instead of quantifying over all possible datasets, we take advantage of any beliefs about the dataset that a querier may have, to quantify over a restricted class of datasets. Specifically, given a query f and a hypothesis H about the structure of a dataset D, we show generically how to transform f into a new query f_H whose global sensitivity (over all datasets including those that do not satisfy H) matches the restricted sensitivity of the query f. Moreover, if the belief of the querier is correct (i.e., D is in H) then f_H(D) = f(D). If the belief is incorrect, then f_H(D) may be inaccurate. We demonstrate the usefulness of this notion by considering the task of answering queries regarding social-networks, which we model as a combination of a graph and a labeling of its vertices. In particular, while our generic procedure is computationally inefficient, for the specific definition of H as graphs of bounded degree, we exhibit efficient ways of constructing f_H using different projection-based techniques. We then analyze two important query classes: subgraph counting queries (e.g., number of triangles) and local profile queries (e.g., number of people who know a spy and a computer-scientist who know each other). We demonstrate that the restricted sensitivity of such queries can be significantly lower than their smooth sensitivity. Thus, using restricted sensitivity we can maintain privacy whether or not D is in H, while providing more accurate results in the event that H holds true

arXiv.org e-Print Archive

CiteSeerX

Parameterized Complexity of the k-anonymity Problem

Author: A Gionis
A Meyerson
G Aggarwal
G Ausiello
Gianluca Della Vedova
H Alt
H Park
J Blocki
L Sweeney
P Alimonti
P Bonizzoni
P Samarati
P Samarati
PA Evans
Paola Bonizzoni
R Diestel
R Downey
R Niedermeier
RG Downey
Riccardo Dondi
W Du
Yuri Pirola
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/05/2010
Field of study

The problem of publishing personal data without giving up privacy is becoming increasingly important. An interesting formalization that has been recently proposed is the

k

-anonymity. This approach requires that the rows of a table are partitioned in clusters of size at least

k

and that all the rows in a cluster become the same tuple, after the suppression of some entries. The natural optimization problem, where the goal is to minimize the number of suppressed entries, is known to be APX-hard even when the records values are over a binary alphabet and

k=3

, and when the records have length at most 8 and

k=4

. In this paper we study how the complexity of the problem is influenced by different parameters. In this paper we follow this direction of research, first showing that the problem is W[1]-hard when parameterized by the size of the solution (and the value

k

). Then we exhibit a fixed parameter algorithm, when the problem is parameterized by the size of the alphabet and the number of columns. Finally, we investigate the computational (and approximation) complexity of the

k

-anonymity problem, when restricting the instance to records having length bounded by 3 and

k=3

. We show that such a restriction is APX-hard.Comment: 22 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Clustering in complex networks. II. Percolation properties

Author: M. Molloy
M. Ángeles Serrano
Marián Boguñá
P. Erdös
P. Erdös
P. Grassberger
R. M. May
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2006
Field of study

The percolation properties of clustered networks are analyzed in detail. In the case of weak clustering, we present an analytical approach that allows to find the critical threshold and the size of the giant component. Numerical simulations confirm the accuracy of our results. In more general terms, we show that weak clustering hinders the onset of the giant component whereas strong clustering favors its appearance. This is a direct consequence of the differences in the

k

-core structure of the networks, which are found to be totally different depending on the level of clustering. An empirical analysis of a real social network confirms our predictions.Comment: Updated reference lis

arXiv.org e-Print Archive

Crossref

Diposit Digital de la Universitat de Barcelona

Dagstuhl Reports : Volume 1, Issue 2, February 2011

Author: Schloss Dagstuhl Leibniz-Zentrum für Informatik
Publication venue
Publication date: 09/09/2011
Field of study

Online Privacy: Towards Informational Self-Determination on the Internet (Dagstuhl Perspectives Workshop 11061) : Simone Fischer-Hübner, Chris Hoofnagle, Kai Rannenberg, Michael Waidner, Ioannis Krontiris and Michael Marhöfer Self-Repairing Programs (Dagstuhl Seminar 11062) : Mauro Pezzé, Martin C. Rinard, Westley Weimer and Andreas Zeller Theory and Applications of Graph Searching Problems (Dagstuhl Seminar 11071) : Fedor V. Fomin, Pierre Fraigniaud, Stephan Kreutzer and Dimitrios M. Thilikos Combinatorial and Algorithmic Aspects of Sequence Processing (Dagstuhl Seminar 11081) : Maxime Crochemore, Lila Kari, Mehryar Mohri and Dirk Nowotka Packing and Scheduling Algorithms for Information and Communication Services (Dagstuhl Seminar 11091) Klaus Jansen, Claire Mathieu, Hadas Shachnai and Neal E. Youn

Hochschulschriftenserver - Universität Frankfurt am Main

Provenance Views for Module Privacy

Author: Davidson Susan B.
Khanna Sanjeev
Milo Tova
Panigrahi Debmalya
Roy Sudeepa
Publication venue
Publication date: 01/01/2011
Field of study

Scientific workflow systems increasingly store provenance information about the module executions used to produce a data item, as well as the parameter settings and intermediate data items passed between module executions. However, authors/owners of workflows may wish to keep some of this information confidential. In particular, a module may be proprietary, and users should not be able to infer its behavior by seeing mappings between all data inputs and outputs. The problem we address in this paper is the following: Given a workflow, abstractly modeled by a relation R, a privacy requirement \Gamma and costs associated with data. The owner of the workflow decides which data (attributes) to hide, and provides the user with a view R' which is the projection of R over attributes which have not been hidden. The goal is to minimize the cost of hidden data while guaranteeing that individual modules are \Gamma -private. We call this the "secureview" problem. We formally define the problem, study its complexity, and offer algorithmic solutions

arXiv.org e-Print Archive

CiteSeerX

Crossref

ScholarlyCommons@Penn