Search CORE

6,102 research outputs found

Sample Complexity Analysis for Learning Overcomplete Latent Variable Models through Tensor Methods

Author: Anandkumar Animashree
Ge Rong
Janzamin Majid
Publication venue
Publication date: 03/08/2014
Field of study

We provide guarantees for learning latent variable models emphasizing on the overcomplete regime, where the dimensionality of the latent space can exceed the observed dimensionality. In particular, we consider multiview mixtures, spherical Gaussian mixtures, ICA, and sparse coding models. We provide tight concentration bounds for empirical moments through novel covering arguments. We analyze parameter recovery through a simple tensor power update algorithm. In the semi-supervised setting, we exploit the label or prior information to get a rough estimate of the model parameters, and then refine it using the tensor method on unlabeled samples. We establish that learning is possible when the number of components scales as

k=o(d^{p/2})

, where

d

is the observed dimension, and

p

is the order of the observed moment employed in the tensor method. Our concentration bound analysis also leads to minimax sample complexity for semi-supervised learning of spherical Gaussian mixtures. In the unsupervised setting, we use a simple initialization algorithm based on SVD of the tensor slices, and provide guarantees under the stricter condition that

k\le \beta d

(where constant

\beta

can be larger than

1

), where the tensor method recovers the components under a polynomial running time (and exponential in

\beta

). Our analysis establishes that a wide range of overcomplete latent variable models can be learned efficiently with low computational and sample complexity through tensor decomposition methods.Comment: Title change

arXiv.org e-Print Archive

eScholarship - University of California

Spectral Norm of Random Kernel Matrices with Applications to Privacy

Author: Kasiviswanathan Shiva Prasad
Rudelson Mark
Publication venue
Publication date: 01/01/2015
Field of study

Kernel methods are an extremely popular set of techniques used for many important machine learning and data analysis applications. In addition to having good practical performances, these methods are supported by a well-developed theory. Kernel methods use an implicit mapping of the input data into a high dimensional feature space defined by a kernel function, i.e., a function returning the inner product between the images of two data points in the feature space. Central to any kernel method is the kernel matrix, which is built by evaluating the kernel function on a given sample dataset. In this paper, we initiate the study of non-asymptotic spectral theory of random kernel matrices. These are n x n random matrices whose (i,j)th entry is obtained by evaluating the kernel function on

x_i

and

x_j

, where

x_1,...,x_n

are a set of n independent random high-dimensional vectors. Our main contribution is to obtain tight upper bounds on the spectral norm (largest eigenvalue) of random kernel matrices constructed by commonly used kernel functions based on polynomials and Gaussian radial basis. As an application of these results, we provide lower bounds on the distortion needed for releasing the coefficients of kernel ridge regression under attribute privacy, a general privacy notion which captures a large class of privacy definitions. Kernel ridge regression is standard method for performing non-parametric regression that regularly outperforms traditional regression approaches in various domains. Our privacy distortion lower bounds are the first for any kernel technique, and our analysis assumes realistic scenarios for the input, unlike all previous lower bounds for other release problems which only hold under very restrictive input settings.Comment: 16 pages, 1 Figur

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Concentration of weakly dependent Banach-valued sums and applications to statistical learning methods

Author: Blanchard Gilles
Zadorozhnyi Oleksandr
Publication venue
Publication date: 09/12/2018
Field of study

We obtain a Bernstein-type inequality for sums of Banach-valued random variables satisfying a weak dependence assumption of general type and under certain smoothness assumptions of the underlying Banach norm. We use this inequality in order to investigate in the asymptotical regime the error upper bounds for the broad family of spectral regularization methods for reproducing kernel decision rules, when trained on a sample coming from a

\tau-

mixing process.Comment: 39 page

arXiv.org e-Print Archive

HAL Descartes

Hal-Diderot