Search CORE

297 research outputs found

Beating Randomized Response on Incoherent Matrices

Author: Hardt Moritz
Roth Aaron
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 02/11/2011
Field of study

Computing accurate low rank approximations of large matrices is a fundamental data mining task. In many applications however the matrix contains sensitive information about individuals. In such case we would like to release a low rank approximation that satisfies a strong privacy guarantee such as differential privacy. Unfortunately, to date the best known algorithm for this task that satisfies differential privacy is based on naive input perturbation or randomized response: Each entry of the matrix is perturbed independently by a sufficiently large random noise variable, a low rank approximation is then computed on the resulting matrix. We give (the first) significant improvements in accuracy over randomized response under the natural and necessary assumption that the matrix has low coherence. Our algorithm is also very efficient and finds a constant rank approximation of an m x n matrix in time O(mn). Note that even generating the noise matrix required for randomized response already requires time O(mn)

arXiv.org e-Print Archive

CiteSeerX

Crossref

The Noisy Power Method: A Meta Algorithm with Applications

Author: Hardt Moritz
Price Eric
Publication venue
Publication date: 01/01/2014
Field of study

We provide a new robust convergence analysis of the well-known power method for computing the dominant singular vectors of a matrix that we call the noisy power method. Our result characterizes the convergence behavior of the algorithm when a significant amount noise is introduced after each matrix-vector multiplication. The noisy power method can be seen as a meta-algorithm that has recently found a number of important applications in a broad range of machine learning problems including alternating minimization for matrix completion, streaming principal component analysis (PCA), and privacy-preserving spectral analysis. Our general analysis subsumes several existing ad-hoc convergence bounds and resolves a number of open problems in multiple applications including streaming PCA and privacy-preserving singular vector computation.Comment: NIPS 201

arXiv.org e-Print Archive

CiteSeerX

Optimality of the Johnson-Lindenstrauss Lemma

Author: Larsen Kasper Green
Nelson Jelani
Publication venue
Publication date: 08/11/2017
Field of study

For any integers

d, n \geq 2

and

1/({\min\{n,d\}})^{0.4999} < \varepsilon<1

, we show the existence of a set of

n

vectors

X\subset \mathbb{R}^d

such that any embedding

f:X\rightarrow \mathbb{R}^m

satisfying

\forall x,y\in X,\ (1-\varepsilon)\|x-y\|_2^2\le \|f(x)-f(y)\|_2^2 \le (1+\varepsilon)\|x-y\|_2^2

must have

m = \Omega(\varepsilon^{-2} \lg n).

This lower bound matches the upper bound given by the Johnson-Lindenstrauss lemma [JL84]. Furthermore, our lower bound holds for nearly the full range of

\varepsilon

of interest, since there is always an isometric embedding into dimension

\min\{d, n\}

(either the identity map, or projection onto

\mathop{span}(X)

). Previously such a lower bound was only known to hold against linear maps

f

, and not for such a wide range of parameters

\varepsilon, n, d

[LN16]. The best previously known lower bound for general

f

was

m = \Omega(\varepsilon^{-2}\lg n/\lg(1/\varepsilon))

[Wel74, Lev83, Alo03], which is suboptimal for any

\varepsilon = o(1)

.Comment: v2: simplified proof, also added reference to Lev8

arXiv.org e-Print Archive

Crossref

Wishart Mechanism for Differentially Private Principal Components Analysis

Author: Jiang Wuxuan
Xie Cong
Zhang Zhihua
Publication venue
Publication date: 19/11/2015
Field of study

We propose a new input perturbation mechanism for publishing a covariance matrix to achieve

(\epsilon,0)

-differential privacy. Our mechanism uses a Wishart distribution to generate matrix noise. In particular, We apply this mechanism to principal component analysis. Our mechanism is able to keep the positive semi-definiteness of the published covariance matrix. Thus, our approach gives rise to a general publishing framework for input perturbation of a symmetric positive semidefinite matrix. Moreover, compared with the classic Laplace mechanism, our method has better utility guarantee. To the best of our knowledge, Wishart mechanism is the best input perturbation approach for

(\epsilon,0)

-differentially private PCA. We also compare our work with previous exponential mechanism algorithms in the literature and provide near optimal bound while having more flexibility and less computational intractability.Comment: A full version with technical proofs. Accepted to AAAI-1

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications