Search CORE

65 research outputs found

Fourier PCA and Robust Tensor Decomposition

Author: Anandkumar A.
Anandkumar A.
Anderson J.
Arora S.
Belkin M.
Belkin M.
Cardoso J.
Chaudhuri K.
Comon P.
Dasgupta S.
Hyvärinen A.
Kannan R.
Publication venue
Publication date: 27/06/2014
Field of study

Fourier PCA is Principal Component Analysis of a matrix obtained from higher order derivatives of the logarithm of the Fourier transform of a distribution.We make this method algorithmic by developing a tensor decomposition method for a pair of tensors sharing the same vectors in rank-

1

decompositions. Our main application is the first provably polynomial-time algorithm for underdetermined ICA, i.e., learning an

n \times m

matrix

A

from observations

y=Ax

where

x

is drawn from an unknown product distribution with arbitrary non-Gaussian components. The number of component distributions

m

can be arbitrarily higher than the dimension

n

and the columns of

A

only need to satisfy a natural and efficiently verifiable nondegeneracy condition. As a second application, we give an alternative algorithm for learning mixtures of spherical Gaussians with linearly independent means. These results also hold in the presence of Gaussian noise.Comment: Extensively revised; details added; minor errors corrected; exposition improve

arXiv.org e-Print Archive

CiteSeerX

Crossref

Max vs Min: Tensor Decomposition and ICA with nearly Linear Sample Complexity

Author: Vempala Santosh S.
Xiao Ying
Publication venue
Publication date: 01/01/2015
Field of study

We present a simple, general technique for reducing the sample complexity of matrix and tensor decomposition algorithms applied to distributions. We use the technique to give a polynomial-time algorithm for standard ICA with sample complexity nearly linear in the dimension, thereby improving substantially on previous bounds. The analysis is based on properties of random polynomials, namely the spacings of an ensemble of polynomials. Our technique also applies to other applications of tensor decompositions, including spherical Gaussian mixture models

arXiv.org e-Print Archive

CiteSeerX

Heavy-tailed Independent Component Analysis

Author: Anderson Joseph
Goyal Navin
Nandi Anupama
Rademacher Luis
Publication venue
Publication date: 02/09/2015
Field of study

Independent component analysis (ICA) is the problem of efficiently recovering a matrix

A \in \mathbb{R}^{n\times n}

from i.i.d. observations of

X=AS

where

S \in \mathbb{R}^n

is a random vector with mutually independent coordinates. This problem has been intensively studied, but all existing efficient algorithms with provable guarantees require that the coordinates

S_i

have finite fourth moments. We consider the heavy-tailed ICA problem where we do not make this assumption, about the second moment. This problem also has received considerable attention in the applied literature. In the present work, we first give a provably efficient algorithm that works under the assumption that for constant

\gamma > 0

, each

S_i

has finite

(1+\gamma)

-moment, thus substantially weakening the moment requirement condition for the ICA problem to be solvable. We then give an algorithm that works under the assumption that matrix

A

has orthogonal columns but requires no moment assumptions. Our techniques draw ideas from convex geometry and exploit standard properties of the multivariate spherical Gaussian distribution in a novel way.Comment: 30 page

arXiv.org e-Print Archive

Crossref

Probabilistic Neural Network based Approach for Handwritten Character Recognition

Author: Aradhya V.N. Manjunath
Kumar G. Hemantha
Niranjan S. K.
Publication venue: Institute for Project Management Pvt. Ltd
Publication date: 14/08/2020
Field of study

In this paper, recognition system for totally unconstrained handwritten characters for south Indian language of Kannada is proposed. The proposed feature extraction technique is based on Fourier Transform and well known Principal Component Analysis (PCA). The system trains the appropriate frequency band images followed by PCA feature extraction scheme. For subsequent classification technique, Probabilistic Neural Network (PNN) is used. The proposed system is tested on large database containing Kannada characters and also tested on standard COIL-20 object database and the results were found to be better compared to standard techniques

Interscience Research Network

The approach chosen for data dimensionality reduction affects the results of running technique clustering

Author: Cazzola Dario
Chen Xi
Preatoni Ezio
Rivadulla Adrian
Trewartha Grant
Publication venue: International Society of Biomechanics (ISB)
Publication date: 03/08/2023
Field of study

OPUS

The approach chosen for data dimensionality reduction affects the results of running technique clustering

Author: Cazzola Dario
Chen Xi
Preatoni Ezio
Rivadulla Adrian
Trewartha Grant
Publication venue: International Society of Biomechanics (ISB)
Publication date: 03/08/2023
Field of study

OPUS

Overcomplete Independent Component Analysis via SDP

Author: Bach Francis
d'Aspremont Alexandre
Perry Amelia
Podosinnikova Anastasia
Sontag David
Wein Alexander
Publication venue
Publication date: 24/01/2019
Field of study

We present a novel algorithm for overcomplete independent components analysis (ICA), where the number of latent sources k exceeds the dimension p of observed variables. Previous algorithms either suffer from high computational complexity or make strong assumptions about the form of the mixing matrix. Our algorithm does not make any sparsity assumption yet enjoys favorable computational and theoretical properties. Our algorithm consists of two main steps: (a) estimation of the Hessians of the cumulant generating function (as opposed to the fourth and higher order cumulants used by most algorithms) and (b) a novel semi-definite programming (SDP) relaxation for recovering a mixing component. We show that this relaxation can be efficiently solved with a projected accelerated gradient descent method, which makes the whole algorithm computationally practical. Moreover, we conjecture that the proposed program recovers a mixing component at the rate k < p^2/4 and prove that a mixing component can be recovered with high probability when k < (2 - epsilon) p log p when the original components are sampled uniformly at random on the hyper sphere. Experiments are provided on synthetic data and the CIFAR-10 dataset of real images.Comment: Appears in: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019). 21 page

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server