Search CORE

1,029 research outputs found

Efficient First Order Methods for Linear Composite Regularizers

Author: Argyriou Andreas
Micchelli Charles A.
Pontil Massimiliano
Shen Lixin
Xu Yuesheng
Publication venue
Publication date: 01/01/2011
Field of study

A wide class of regularization problems in machine learning and statistics employ a regularization term which is obtained by composing a simple convex function \omega with a linear transformation. This setting includes Group Lasso methods, the Fused Lasso and other total variation methods, multi-task learning methods and many more. In this paper, we present a general approach for computing the proximity operator of this class of regularizers, under the assumption that the proximity operator of the function \omega is known in advance. Our approach builds on a recent line of research on optimal first order optimization methods and uses fixed point iterations for numerically computing the proximity operator. It is more general than current approaches and, as we show with numerical simulations, computationally more efficient than available first order methods which do not achieve the optimal rate. In particular, our method outperforms state of the art O(1/T) methods for overlapping Group Lasso and matches optimal O(1/T^2) methods for the Fused Lasso and tree structured Group Lasso.Comment: 19 pages, 8 figure

arXiv.org e-Print Archive

CiteSeerX

UCL Discovery

Syracuse University Research Facility and Collaborative Environment

Efficient First Order Methods for Linear Composite Regularizers

Author: Argyriou Andreas
Micchelli Charles A.
Pontil Massimiliano
Shen Lixin
Xu Yuesheng
Publication venue: SURFACE at Syracuse University
Publication date: 07/04/2011
Field of study

A wide class of regularization problems in machine learning and statistics employ a regularization term which is obtained by composing a simple convex function omega with a linear transformation. This setting includes Group Lasso methods, the Fused Lasso and other total variation methods, multi-task learning methods and many more. In this paper, we present a general approach for computing the proximity operator of this class of regularizers, under the assumption that the proximity operator of the function \omega is known in advance. Our approach builds on a recent line of research on optimal first order optimization methods and uses fixed point iterations for numerically computing the proximity operator. It is more general than current approaches and, as we show with numerical simulations, computationally more efficient than available first order methods which do not achieve the optimal rate. In particular, our method outperforms state of the art O(1/T) methods for overlapping Group Lasso and matches optimal O(1/T2) methods for the Fused Lasso and tree structured Group Lasso

Syracuse University Research Facility and Collaborative Environment

An Efficient Primal-Dual Prox Method for Non-Smooth Optimization

Author: Jin Rong
Mahdavi Mehrdad
Yang Tianbao
Zhu Shenghuo
Publication venue
Publication date: 26/07/2013
Field of study

We study the non-smooth optimization problems in machine learning, where both the loss function and the regularizer are non-smooth functions. Previous studies on efficient empirical loss minimization assume either a smooth loss function or a strongly convex regularizer, making them unsuitable for non-smooth optimization. We develop a simple yet efficient method for a family of non-smooth optimization problems where the dual form of the loss function is bilinear in primal and dual variables. We cast a non-smooth optimization problem into a minimax optimization problem, and develop a primal dual prox method that solves the minimax optimization problem at a rate of

O(1/T)

{assuming that the proximal step can be efficiently solved}, significantly faster than a standard subgradient descent method that has an

O(1/\sqrt{T})

convergence rate. Our empirical study verifies the efficiency of the proposed method for various non-smooth optimization problems that arise ubiquitously in machine learning by comparing it to the state-of-the-art first order methods

arXiv.org e-Print Archive

CiteSeerX

DC Proximal Newton for Non-Convex Optimization Problems

Author: Flamary Remi
Gasso Gilles
Rakotomamonjy Alain
Publication venue
Publication date: 01/01/2015
Field of study

We introduce a novel algorithm for solving learning problems where both the loss function and the regularizer are non-convex but belong to the class of difference of convex (DC) functions. Our contribution is a new general purpose proximal Newton algorithm that is able to deal with such a situation. The algorithm consists in obtaining a descent direction from an approximation of the loss function and then in performing a line search to ensure sufficient descent. A theoretical analysis is provided showing that the iterates of the proposed algorithm {admit} as limit points stationary points of the DC objective function. Numerical experiments show that our approach is more efficient than current state of the art for a problem with a convex loss functions and non-convex regularizer. We have also illustrated the benefit of our algorithm in high-dimensional transductive learning problem where both loss function and regularizers are non-convex

arXiv.org e-Print Archive

HAL - Normandie Université

Low Complexity Regularization of Linear Inverse Problems

Author: A Girard
A. Barron
A. Beck
A. Beck
A. Chambolle
A. Chambolle
A. Daniilidis
A. Montanari
A.N. Tikhonov
A.N. Tikhonov
A.S. Bandeira
A.S. Lewis
A.S. Lewis
A.S. Lewis
A.S. Lewis
B. Efron
B. Efron
B. Recht
B.A. Turlach
B.C. Vũ
B.D. Rao
B.F. Svaiter
B.K. Natarajan
B.S. Mordukhovich
C. Chaux
C. Deledalle
C. Dossal
C. Dossal
C. Lemaréchal
C. Vonesch
C.-A. Deledalle
C.-A. Deledalle
C.L. Mallows
C.M. Stein
D. Gabay
D. Gabay
D. Gross
D. Needell
D. Ville Van De
D. Ville Van De
D.A. Lorenz
D.A. Spielman
D.L. Donoho
D.L. Donoho
D.L. Donoho
D.L. Donoho
E. Grave
E. Hale
E. Harchaoui
E. J. Candès
E. Richard
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
F. Bach
F. Bach
F. Luisier
F. Santosa
G. Chen
G. Davis
G. Obozinski
G. Peyré
G. Steidl
G.B. Passty
G.H. Golub
H. Akaike
H. Jégou
H. Jégou
H. Raguet
H. Zou
H.H. Bauschke
H.H. Bauschke
H.L. Taylor
H.M. Hudson
I. Daubechies
J. Allen
J. Bolte
J. Chen
J. Douglas
J. Eckstein
J. Eckstein
J. Eckstein
J. Mairal
J. Tropp
J. Ye
J.-B. Hiriart-Urruty
J.-C. Pesquet
J.-F. Aujol
J.-J. Fuchs
J.-J. Fuchs
J.-L. Starck
J.-L. Starck
J.A. Tropp
J.C. Dunn
J.E. Vogt
J.F. Claerbout
J.M. Lee
K. Bredies
K. Bredies
K. Bredies
K. Kato
K. Knight
K.-C. Li
L. Birgé
L. Birgé
L. Borup
L. Condat
L.I. Rudin
L.M. Briceño Arias
M Coste
M. Elad
M. Elad
M. Fazel
M. Fortin
M. Frank
M. Golbabaee
M. Grasmair
M. Jaggi
M. Meyer
M. Rudelson
M. Yuan
M.J. Wainwright
M.R. Osborne
M.V. Solodov
N. Parikh
O. Scherzer
P. Hall
P. Hall
P. Tseng
P. Zhao
P. Zhao
P.L. Combettes
P.L. Combettes
P.L. Combettes
P.L. Lions
R. Ciak
R. Giryes
R. Glowinski
R. Gribonval
R. Gribonval
R. Gribonval
R. Gribonval
R. Jenatton
R. Jenatton
R. Refregier
R. Tibshirani
R. Tibshirani
R.J. Tibshirani
R.J. Tibshirani
R.L. Dykstra
R.T. Rockafellar
S. Nam
S. Negahban
S. Ramani
S. Ramani
S. Shalev-Shwartz
S. Vaiter
S. Vaiter
S.F. Cotter
S.G. Lingala
S.G. Mallat
S.G. Mallat
S.G. Mallat
S.J. Wright
S.N. Negahban
S.P. Boyd
S.S. Chen
T. Blu
T. Blumensath
T. Strohmer
T.T. Cai
T.T. Cai
T.T. Cai
V. Chandrasekaran
V. Duval
V. Solo
W.L. Hare
X. Shen
Y. Castro de
Y. Censor
Y. Chen
Y. Lyubarskii
Y. Nesterov
Y. Nesterov
Y.C. Eldar
Y.C. Pati
Publication venue
Publication date: 01/01/2014
Field of study

Inverse problems and regularization theory is a central theme in contemporary signal processing, where the goal is to reconstruct an unknown signal from partial indirect, and possibly noisy, measurements of it. A now standard method for recovering the unknown signal is to solve a convex optimization problem that enforces some prior knowledge about its structure. This has proved efficient in many problems routinely encountered in imaging sciences, statistics and machine learning. This chapter delivers a review of recent advances in the field where the regularization prior promotes solutions conforming to some notion of simplicity/low-complexity. These priors encompass as popular examples sparsity and group sparsity (to capture the compressibility of natural signals and images), total variation and analysis sparsity (to promote piecewise regularity), and low-rank (as natural extension of sparsity to matrix-valued data). Our aim is to provide a unified treatment of all these regularizations under a single umbrella, namely the theory of partial smoothness. This framework is very general and accommodates all low-complexity regularizers just mentioned, as well as many others. Partial smoothness turns out to be the canonical way to encode low-dimensional models that can be linear spaces or more general smooth manifolds. This review is intended to serve as a one stop shop toward the understanding of the theoretical properties of the so-regularized solutions. It covers a large spectrum including: (i) recovery guarantees and stability to noise, both in terms of

\ell^2

-stability and model (manifold) identification; (ii) sensitivity analysis to perturbations of the parameters involved (in particular the observations), with applications to unbiased risk estimation ; (iii) convergence properties of the forward-backward proximal splitting scheme, that is particularly well suited to solve the corresponding large-scale regularized optimization problem

arXiv.org e-Print Archive

HAL - Normandie Université

CiteSeerX

Base de publications de l'université Paris-Dauphine

Crossref

Non-convex regularization in remote sensing

Author: Barlaud Michel
Flamary Remi
Tuia Devis
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

In this paper, we study the effect of different regularizers and their implications in high dimensional image classification and sparse linear unmixing. Although kernelization or sparse methods are globally accepted solutions for processing data in high dimensions, we present here a study on the impact of the form of regularization used and its parametrization. We consider regularization via traditional squared (2) and sparsity-promoting (1) norms, as well as more unconventional nonconvex regularizers (p and Log Sum Penalty). We compare their properties and advantages on several classification and linear unmixing tasks and provide advices on the choice of the best regularizer for the problem at hand. Finally, we also provide a fully functional toolbox for the community.Comment: 11 pages, 11 figure

arXiv.org e-Print Archive