Search CORE

5,666 research outputs found

Guarantees of Riemannian Optimization for Low Rank Matrix Completion

Author: Cai Jian-Feng
Chan Tony F.
Leung Shingyu
Wei Ke
Publication venue
Publication date: 21/03/2016
Field of study

We study the Riemannian optimization methods on the embedded manifold of low rank matrices for the problem of matrix completion, which is about recovering a low rank matrix from its partial entries. Assume

m

entries of an

n\times n

rank

r

matrix are sampled independently and uniformly with replacement. We first prove that with high probability the Riemannian gradient descent and conjugate gradient descent algorithms initialized by one step hard thresholding are guaranteed to converge linearly to the measured matrix provided \begin{align*} m\geq C_\kappa n^{1.5}r\log^{1.5}(n), \end{align*} where

C_\kappa

is a numerical constant depending on the condition number of the underlying matrix. The sampling complexity has been further improved to \begin{align*} m\geq C_\kappa nr^2\log^{2}(n) \end{align*} via the resampled Riemannian gradient descent initialization. The analysis of the new initialization procedure relies on an asymmetric restricted isometry property of the sampling operator and the curvature of the low rank matrix manifold. Numerical simulation shows that the algorithms are able to recover a low rank matrix from nearly the minimum number of measurements

arXiv.org e-Print Archive

eScholarship - University of California

Alternating least squares as moving subspace correction

Author: Oseledets Ivan
Rakhuba Maxim
Uschmajew André
Publication venue
Publication date: 01/01/2018
Field of study

In this note we take a new look at the local convergence of alternating optimization methods for low-rank matrices and tensors. Our abstract interpretation as sequential optimization on moving subspaces yields insightful reformulations of some known convergence conditions that focus on the interplay between the contractivity of classical multiplicative Schwarz methods with overlapping subspaces and the curvature of low-rank matrix and tensor manifolds. While the verification of the abstract conditions in concrete scenarios remains open in most cases, we are able to provide an alternative and conceptually simple derivation of the asymptotic convergence rate of the two-sided block power method of numerical algebra for computing the dominant singular subspaces of a rectangular matrix. This method is equivalent to an alternating least squares method applied to a distance function. The theoretical results are illustrated and validated by numerical experiments.Comment: 20 pages, 4 figure

arXiv.org e-Print Archive

OPUS Augsburg

MPG.PuRe

Convergence results for projected line-search methods on varieties of low-rank matrices via \L{}ojasiewicz inequality

Author: Schneider Reinhold
Uschmajew André
Publication venue
Publication date: 22/04/2015
Field of study

The aim of this paper is to derive convergence results for projected line-search methods on the real-algebraic variety

\mathcal{M}_{\le k}

of real

m \times n

matrices of rank at most

k

. Such methods extend Riemannian optimization methods, which are successfully used on the smooth manifold

\mathcal{M}_k

of rank-

k

matrices, to its closure by taking steps along gradient-related directions in the tangent cone, and afterwards projecting back to

\mathcal{M}_{\le k}

. Considering such a method circumvents the difficulties which arise from the nonclosedness and the unbounded curvature of

\mathcal{M}_k

. The pointwise convergence is obtained for real-analytic functions on the basis of a \L{}ojasiewicz inequality for the projection of the antigradient to the tangent cone. If the derived limit point lies on the smooth part of

\mathcal{M}_{\le k}

, i.e. in

\mathcal{M}_k

, this boils down to more or less known results, but with the benefit that asymptotic convergence rate estimates (for specific step-sizes) can be obtained without an a priori curvature bound, simply from the fact that the limit lies on a smooth manifold. At the same time, one can give a convincing justification for assuming critical points to lie in

\mathcal{M}_k

: if

X

is a critical point of

f

\mathcal{M}_{\le k}

, then either

X

has rank

k

, or

\nabla f(X) = 0

arXiv.org e-Print Archive

CiteSeerX

Regression on fixed-rank positive semidefinite matrices: a Riemannian approach

Author: Bonnabel Silvere
Meyer Gilles
Sepulchre Rodolphe
Publication venue
Publication date: 31/01/2011
Field of study

The paper addresses the problem of learning a regression model parameterized by a fixed-rank positive semidefinite matrix. The focus is on the nonlinear nature of the search space and on scalability to high-dimensional problems. The mathematical developments rely on the theory of gradient descent algorithms adapted to the Riemannian geometry that underlies the set of fixed-rank positive semidefinite matrices. In contrast with previous contributions in the literature, no restrictions are imposed on the range space of the learned matrix. The resulting algorithms maintain a linear complexity in the problem size and enjoy important invariance properties. We apply the proposed algorithms to the problem of learning a distance function parameterized by a positive semidefinite matrix. Good performance is observed on classical benchmarks

arXiv.org e-Print Archive

Open Repository and Bibliography - Liège