Search CORE

360 research outputs found

Relaxed Majorization-Minimization for Non-smooth and Non-convex Optimization

Author: Lin Zhouchen
Xu Chen
Zha Hongbin
Zhao Zhenyu
Publication venue
Publication date: 25/11/2015
Field of study

We propose a new majorization-minimization (MM) method for non-smooth and non-convex programs, which is general enough to include the existing MM methods. Besides the local majorization condition, we only require that the difference between the directional derivatives of the objective function and its surrogate function vanishes when the number of iterations approaches infinity, which is a very weak condition. So our method can use a surrogate function that directly approximates the non-smooth objective function. In comparison, all the existing MM methods construct the surrogate function by approximating the smooth component of the objective function. We apply our relaxed MM methods to the robust matrix factorization (RMF) problem with different regularizations, where our locally majorant algorithm shows advantages over the state-of-the-art approaches for RMF. This is the first algorithm for RMF ensuring, without extra assumptions, that any limit point of the iterates is a stationary point.Comment: AAAI1

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

A D.C. Programming Approach to the Sparse Generalized Eigenvalue Problem

Author: Lanckriet Gert
Sriperumbudur Bharath
Torres David
Publication venue
Publication date: 01/01/2009
Field of study

In this paper, we consider the sparse eigenvalue problem wherein the goal is to obtain a sparse solution to the generalized eigenvalue problem. We achieve this by constraining the cardinality of the solution to the generalized eigenvalue problem and obtain sparse principal component analysis (PCA), sparse canonical correlation analysis (CCA) and sparse Fisher discriminant analysis (FDA) as special cases. Unlike the

\ell_1

-norm approximation to the cardinality constraint, which previous methods have used in the context of sparse PCA, we propose a tighter approximation that is related to the negative log-likelihood of a Student's t-distribution. The problem is then framed as a d.c. (difference of convex functions) program and is solved as a sequence of convex programs by invoking the majorization-minimization method. The resulting algorithm is proved to exhibit \emph{global convergence} behavior, i.e., for any random initialization, the sequence (subsequence) of iterates generated by the algorithm converges to a stationary point of the d.c. program. The performance of the algorithm is empirically demonstrated on both sparse PCA (finding few relevant genes that explain as much variance as possible in a high-dimensional gene dataset) and sparse CCA (cross-language document retrieval and vocabulary selection for music retrieval) applications.Comment: 40 page

arXiv.org e-Print Archive

CiteSeerX

Truncated Inference for Latent Variable Optimization Problems: Application to Robust Estimation and Learning

Author: AL Yuille
AP Dempster
B Scellier
C Zach
CJ Wu
CL Byrne
D Geman
DP Bertsekas
DR Hunter
F Scarselli
FJ Pineda
J Mairal
K Lange
M Razaviyayn
P Dvurechensky
PF Felzenszwalb
RM Neal
S Agarwal
SJ Wright
X Xie
Y LeCun
ZQ Luo
Publication venue
Publication date: 01/01/2020
Field of study

Optimization problems with an auxiliary latent variable structure in addition to the main model parameters occur frequently in computer vision and machine learning. The additional latent variables make the underlying optimization task expensive, either in terms of memory (by maintaining the latent variables), or in terms of runtime (repeated exact inference of latent variables). We aim to remove the need to maintain the latent variables and propose two formally justified methods, that dynamically adapt the required accuracy of latent variable inference. These methods have applications in large scale robust estimation and in learning energy-based models from labeled data.Comment: 16 page

arXiv.org e-Print Archive

Crossref

Chalmers Research