slides

Faster Algorithms for Privately Releasing Marginals

Abstract

We study the problem of releasing kk-way marginals of a database D∈({0,1}d)nD \in (\{0,1\}^d)^n, while preserving differential privacy. The answer to a kk-way marginal query is the fraction of DD's records x∈{0,1}dx \in \{0,1\}^d with a given value in each of a given set of up to kk columns. Marginal queries enable a rich class of statistical analyses of a dataset, and designing efficient algorithms for privately releasing marginal queries has been identified as an important open problem in private data analysis (cf. Barak et. al., PODS '07). We give an algorithm that runs in time dO(k)d^{O(\sqrt{k})} and releases a private summary capable of answering any kk-way marginal query with at most Β±.01\pm .01 error on every query as long as nβ‰₯dO(k)n \geq d^{O(\sqrt{k})}. To our knowledge, ours is the first algorithm capable of privately releasing marginal queries with non-trivial worst-case accuracy guarantees in time substantially smaller than the number of kk-way marginal queries, which is dΘ(k)d^{\Theta(k)} (for kβ‰ͺdk \ll d)

    Similar works