Search CORE

13 research outputs found

Fast Conical Hull Algorithms for Near-separable Non-negative Matrix Factorization

Author: Kambadur Prabhanjan
Kumar Abhishek
Sindhwani Vikas
Publication venue
Publication date: 03/10/2012
Field of study

The separability assumption (Donoho & Stodden, 2003; Arora et al., 2012) turns non-negative matrix factorization (NMF) into a tractable problem. Recently, a new class of provably-correct NMF algorithms have emerged under this assumption. In this paper, we reformulate the separable NMF problem as that of finding the extreme rays of the conical hull of a finite set of vectors. From this geometric perspective, we derive new separable NMF algorithms that are highly scalable and empirically noise robust, and have several other favorable properties in relation to existing methods. A parallel implementation of our algorithm demonstrates high scalability on shared- and distributed-memory machines.Comment: 15 pages, 6 figure

arXiv.org e-Print Archive

CiteSeerX

CUR Algorithm for Partially Observed Matrices

Author: Jin Rong
Xu Miao
Zhou Zhi-Hua
Publication venue
Publication date: 04/11/2014
Field of study

CUR matrix decomposition computes the low rank approximation of a given matrix by using the actual rows and columns of the matrix. It has been a very useful tool for handling large matrices. One limitation with the existing algorithms for CUR matrix decomposition is that they need an access to the {\it full} matrix, a requirement that can be difficult to fulfill in many real world applications. In this work, we alleviate this limitation by developing a CUR decomposition algorithm for partially observed matrices. In particular, the proposed algorithm computes the low rank approximation of the target matrix based on (i) the randomly sampled rows and columns, and (ii) a subset of observed entries that are randomly sampled from the matrix. Our analysis shows the relative error bound, measured by spectral norm, for the proposed algorithm when the target matrix is of full rank. We also show that only

O(n r\ln r)

observed entries are needed by the proposed algorithm to perfectly recover a rank

r

matrix of size

n\times n

, which improves the sample complexity of the existing algorithms for matrix completion. Empirical studies on both synthetic and real-world datasets verify our theoretical claims and demonstrate the effectiveness of the proposed algorithm

arXiv.org e-Print Archive

CiteSeerX

Optimal CUR Matrix Decompositions

Author: Boutsidis C.
Drineas P.
Gu M.
Guruswami V.
Wang S.
Publication venue
Publication date: 16/07/2014
Field of study

The CUR decomposition of an

m \times n

matrix

A

finds an

m \times c

matrix

C

with a subset of

c < n

columns of

A,

together with an

r \times n

matrix

R

with a subset of

r < m

rows of

A,

as well as a

c \times r

low-rank matrix

U

such that the matrix

C U R

approximates the matrix

A,

that is,

|| A - CUR ||_F^2 \le (1+\epsilon) || A - A_k||_F^2

, where

||.||_F

denotes the Frobenius norm and

A_k

is the best

m \times n

matrix of rank

k

constructed via the SVD. We present input-sparsity-time and deterministic algorithms for constructing such a CUR decomposition where

c=O(k/\epsilon)

and

r=O(k/\epsilon)

and rank

(U) = k

. Up to constant factors, our algorithms are simultaneously optimal in

c, r,

and rank

(U)

.Comment: small revision in lemma 4.

arXiv.org e-Print Archive

CiteSeerX

Crossref

Convex and Network Flow Optimization for Structured Sparsity

Author: Bach Francis
Jenatton Rodolphe
Mairal Julien
Obozinski Guillaume
Publication venue
Publication date: 01/01/2011
Field of study

We consider a class of learning problems regularized by a structured sparsity-inducing norm defined as the sum of l_2- or l_infinity-norms over groups of variables. Whereas much effort has been put in developing fast optimization techniques when the groups are disjoint or embedded in a hierarchy, we address here the case of general overlapping groups. To this end, we present two different strategies: On the one hand, we show that the proximal operator associated with a sum of l_infinity-norms can be computed exactly in polynomial time by solving a quadratic min-cost flow problem, allowing the use of accelerated proximal gradient methods. On the other hand, we use proximal splitting techniques, and address an equivalent formulation with non-overlapping groups, but in higher dimension and with additional constraints. We propose efficient and scalable algorithms exploiting these two strategies, which are significantly faster than alternative approaches. We illustrate these methods with several problems such as CUR matrix factorization, multi-task learning of tree-structured dictionaries, background subtraction in video sequences, image denoising with wavelets, and topographic dictionary learning of natural image patches.Comment: to appear in the Journal of Machine Learning Research (JMLR

arXiv.org e-Print Archive

CiteSeerX

INRIA a CCSD electronic archive server

HAL-Rennes 1