On Using Toeplitz and Circulant Matrices for Johnson-Lindenstrauss Transforms

Abstract

The Johnson-Lindenstrauss lemma is one of the corner stone results in dimensionality reduction. It says that given NN, for any set of NN vectors XRnX \subset \mathbb{R}^n, there exists a mapping f:XRmf : X \to \mathbb{R}^m such that f(X)f(X) preserves all pairwise distances between vectors in XX to within (1±ε)(1 \pm \varepsilon) if m=O(ε2lgN)m = O(\varepsilon^{-2} \lg N). Much effort has gone into developing fast embedding algorithms, with the Fast Johnson-Lindenstrauss transform of Ailon and Chazelle being one of the most well-known techniques. The current fastest algorithm that yields the optimal m=O(ε2lgN)m = O(\varepsilon^{-2}\lg N) dimensions has an embedding time of O(nlgn+ε2lg3N)O(n \lg n + \varepsilon^{-2} \lg^3 N). An exciting approach towards improving this, due to Hinrichs and Vyb\'iral, is to use a random m×nm \times n Toeplitz matrix for the embedding. Using Fast Fourier Transform, the embedding of a vector can then be computed in O(nlgm)O(n \lg m) time. The big question is of course whether m=O(ε2lgN)m = O(\varepsilon^{-2} \lg N) dimensions suffice for this technique. If so, this would end a decades long quest to obtain faster and faster Johnson-Lindenstrauss transforms. The current best analysis of the embedding of Hinrichs and Vyb\'iral shows that m=O(ε2lg2N)m = O(\varepsilon^{-2}\lg^2 N) dimensions suffices. The main result of this paper, is a proof that this analysis unfortunately cannot be tightened any further, i.e., there exists a set of NN vectors requiring m=Ω(ε2lg2N)m = \Omega(\varepsilon^{-2} \lg^2 N) for the Toeplitz approach to work

    Similar works