Search CORE

1,172 research outputs found

A Sparse Johnson--Lindenstrauss Transform

Author: Dasgupta Anirban
Kumar Ravi
Sarlós Tamás
Publication venue
Publication date: 01/01/2010
Field of study

Dimension reduction is a key algorithmic tool with many applications including nearest-neighbor search, compressed sensing and linear algebra in the streaming model. In this work we obtain a {\em sparse} version of the fundamental tool in dimension reduction --- the Johnson--Lindenstrauss transform. Using hashing and local densification, we construct a sparse projection matrix with just

\tilde{O}(\frac{1}{\epsilon})

non-zero entries per column. We also show a matching lower bound on the sparsity for a large class of projection matrices. Our bounds are somewhat surprising, given the known lower bounds of

\Omega(\frac{1}{\epsilon^2})

both on the number of rows of any projection matrix and on the sparsity of projection matrices generated by natural constructions. Using this, we achieve an

\tilde{O}(\frac{1}{\epsilon})

update time per non-zero element for a

(1\pm\epsilon)

-approximate projection, thereby substantially outperforming the

\tilde{O}(\frac{1}{\epsilon^2})

update time required by prior approaches. A variant of our method offers the same guarantees for sparse vectors, yet its

\tilde{O}(d)

worst case running time matches the best approach of Ailon and Liberty.Comment: 10 pages, conference version

arXiv.org e-Print Archive

CiteSeerX

Toward a unified theory of sparse dimensionality reduction in Euclidean space

Author: Avron H.
Bühlmann P.
Candès E.
Hegde C.
Lu Y.
Paul S.
Talagrand M.
Woodruff D. P.
Publication venue
Publication date: 01/01/2015
Field of study

Let

\Phi\in\mathbb{R}^{m\times n}

be a sparse Johnson-Lindenstrauss transform [KN14] with

s

non-zeroes per column. For a subset

T

of the unit sphere,

\varepsilon\in(0,1/2)

given, we study settings for

m,s

required to ensure

\mathop{\mathbb{E}}_\Phi \sup_{x\in T} \left|\|\Phi x\|_2^2 - 1 \right| < \varepsilon ,

i.e. so that

\Phi

preserves the norm of every

x\in T

simultaneously and multiplicatively up to

1+\varepsilon

. We introduce a new complexity parameter, which depends on the geometry of

T

, and show that it suffices to choose

s

and

m

such that this parameter is small. Our result is a sparse analog of Gordon's theorem, which was concerned with a dense

\Phi

having i.i.d. Gaussian entries. We qualitatively unify several results related to the Johnson-Lindenstrauss lemma, subspace embeddings, and Fourier-based restricted isometries. Our work also implies new results in using the sparse Johnson-Lindenstrauss transform in numerical linear algebra, classical and model-based compressed sensing, manifold learning, and constrained least squares problems such as the Lasso

arXiv.org e-Print Archive

CiteSeerX

Crossref

Publikationsserver der RWTH Aachen University

Utrecht University Repository

A Sparse Johnson-Lindenstrauss Transform Using Fast Hashing

Author: Thorup Mikkel
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 50th International Colloquium on Automata, Languages, and Programming (ICALP 2023)
Publication date: 01/01/2023
Field of study

Dagstuhl Research Online Publication Server

Recommended from our members

A Derandomized Sparse Johnson-Lindenstrauss Transform

Author: Kane Daniel M.
Nelson Jelani
Publication venue
Publication date: 27/10/2017
Field of study

Recent work of [Dasgupta-Kumar-Sarl´os, STOC 2010] gave a sparse Johnson-Lindenstrauss transform and left as a main open question whether their construction could be efficiently derandomized. We answer their question affirmatively by giving an alternative proof of their result requiring only bounded independence hash functions. Furthermore, the sparsity bound obtained in our proof is improved. Our work implies the first implementation of a JohnsonLindenstrauss transform in data streams with sublinear update time.Engineering and Applied Science

Harvard University - DASH

Recommended from our members

Toward a unified theory of sparse dimensionality reduction in Euclidean space

Author: Bourgain Jean
Nelson Jelani
Sjoerd Dirksen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/03/2015
Field of study

Let

\Phi \in \mathbb{R}^{m×n}

be a sparse Johnson-Lindenstrauss transform [KN14] with s non-zeroes per column. For a subset T of the unit sphere,

\epsilon \in (0,1/2)

given, we study settings for m,s required to ensure

\underset {\Phi_{x \in T}} {\mathbb{E} sup} \mid || \Phi x ||^2_2 - 1 \mid < \epsilon

, i.e.\ so that

\Phi

preserves the norm of every

x \in T

simultaneously and multiplicatively up to

1+\epsilon

. We introduce a new complexity parameter, which depends on the geometry of T, and show that it suffices to choose s and m such that this parameter is small. Our result is a sparse analog of Gordon's theorem, which was concerned with a dense

\Phi

having i.i.d. gaussian entries. We qualitatively unify several results related to the Johnson-Lindenstrauss lemma, subspace embeddings, and Fourier-based restricted isometries. Our work also implies new results in using the sparse Johnson-Lindenstrauss transform in numerical linear algebra, classical and model-based compressed sensing, manifold learning, and constrained least squares problems such as the Lasso.Engineering and Applied Science

Harvard University - DASH

Almost Optimal Unrestricted Fast Johnson-Lindenstrauss Transform

Author: Ailon Nir
Liberty Edo
Publication venue
Publication date: 01/01/2010
Field of study

The problems of random projections and sparse reconstruction have much in common and individually received much attention. Surprisingly, until now they progressed in parallel and remained mostly separate. Here, we employ new tools from probability in Banach spaces that were successfully used in the context of sparse reconstruction to advance on an open problem in random pojection. In particular, we generalize and use an intricate result by Rudelson and Vershynin for sparse reconstruction which uses Dudley's theorem for bounding Gaussian processes. Our main result states that any set of

N = \exp(\tilde{O}(n))

real vectors in

n

dimensional space can be linearly mapped to a space of dimension k=O(\log N\polylog(n)), while (1) preserving the pairwise distances among the vectors to within any constant distortion and (2) being able to apply the transformation in time

O(n\log n)

on each vector. This improves on the best known

N = \exp(\tilde{O}(n^{1/2}))

achieved by Ailon and Liberty and

N = \exp(\tilde{O}(n^{1/3}))

by Ailon and Chazelle. The dependence in the distortion constant however is believed to be suboptimal and subject to further investigation. For constant distortion, this settles the open question posed by these authors up to a \polylog(n) factor while considerably simplifying their constructions

arXiv.org e-Print Archive

CiteSeerX