Search CORE

2,479 research outputs found

On the Convergence of (Stochastic) Gradient Descent with Extrapolation for Non-Convex Optimization

Author: Jin Rong
Xu Yi
Yang Sen
Yang Tianbao
Yuan Zhuoning
Publication venue
Publication date: 05/02/2019
Field of study

Extrapolation is a well-known technique for solving convex optimization and variational inequalities and recently attracts some attention for non-convex optimization. Several recent works have empirically shown its success in some machine learning tasks. However, it has not been analyzed for non-convex minimization and there still remains a gap between the theory and the practice. In this paper, we analyze gradient descent and stochastic gradient descent with extrapolation for finding an approximate first-order stationary point in smooth non-convex optimization problems. Our convergence upper bounds show that the algorithms with extrapolation can be accelerated than without extrapolation

arXiv.org e-Print Archive

Crossref

A Smooth Primal-Dual Optimization Framework for Nonsmooth Composite Convex Minimization

Author: Cevher Volkan
Fercoq Olivier
Tran-Dinh Quoc
Publication venue
Publication date: 22/02/2018
Field of study

We propose a new first-order primal-dual optimization framework for a convex optimization template with broad applications. Our optimization algorithms feature optimal convergence guarantees under a variety of common structure assumptions on the problem template. Our analysis relies on a novel combination of three classic ideas applied to the primal-dual gap function: smoothing, acceleration, and homotopy. The algorithms due to the new approach achieve the best known convergence rate results, in particular when the template consists of only non-smooth functions. We also outline a restart strategy for the acceleration to significantly enhance the practical performance. We demonstrate relations with the augmented Lagrangian method and show how to exploit the strongly convex objectives with rigorous convergence rate guarantees. We provide numerical evidence with two examples and illustrate that the new methods can outperform the state-of-the-art, including Chambolle-Pock, and the alternating direction method-of-multipliers algorithms.Comment: 35 pages, accepted for publication on SIAM J. Optimization. Tech. Report, Oct. 2015 (last update Sept. 2016

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Generalized Forward-Backward Splitting

Author: Bricen͂o-Arias L. M.
Combettes P. L.
Combettes P. L.
Gabay D.
Gabriel Peyré
Hugo Raguet
Jalal Fadili
Korpelevich G. M.
Lemaire B.
Moreau J.-J.
Pesquet J.-C.
Popov L. D.
Zarantonello E. H.
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 12/01/2012
Field of study

This paper introduces the generalized forward-backward splitting algorithm for minimizing convex functions of the form

F + \sum_{i=1}^n G_i

, where

F

has a Lipschitz-continuous gradient and the

G_i

's are simple in the sense that their Moreau proximity operators are easy to compute. While the forward-backward algorithm cannot deal with more than

n = 1

non-smooth function, our method generalizes it to the case of arbitrary

n

. Our method makes an explicit use of the regularity of

F

in the forward step, and the proximity operators of the

G_i

's are applied in parallel in the backward step. This allows the generalized forward backward to efficiently address an important class of convex problems. We prove its convergence in infinite dimension, and its robustness to errors on the computation of the proximity operators and of the gradient of

F

. Examples on inverse problems in imaging demonstrate the advantage of the proposed methods in comparison to other splitting algorithms.Comment: 24 pages, 4 figure

arXiv.org e-Print Archive

HAL - Normandie Université

Base de publications de l'université Paris-Dauphine

CiteSeerX

Crossref