Search CORE

202 research outputs found

Hessian barrier algorithms for linearly constrained optimization problems

Author: Bomze Immanuel M.
Mertikopoulos Panayotis
Schachinger Werner
Staudigl Mathias
Publication venue
Publication date: 01/01/2018
Field of study

In this paper, we propose an interior-point method for linearly constrained optimization problems (possibly nonconvex). The method - which we call the Hessian barrier algorithm (HBA) - combines a forward Euler discretization of Hessian Riemannian gradient flows with an Armijo backtracking step-size policy. In this way, HBA can be seen as an alternative to mirror descent (MD), and contains as special cases the affine scaling algorithm, regularized Newton processes, and several other iterative solution methods. Our main result is that, modulo a non-degeneracy condition, the algorithm converges to the problem's set of critical points; hence, in the convex case, the algorithm converges globally to the problem's minimum set. In the case of linearly constrained quadratic programs (not necessarily convex), we also show that the method's convergence rate is

\mathcal{O}(1/k^\rho)

for some

\rho\in(0,1]

that depends only on the choice of kernel function (i.e., not on the problem's primitives). These theoretical results are validated by numerical experiments in standard non-convex test functions and large-scale traffic assignment problems.Comment: 27 pages, 6 figure

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

MAnnheim DOCument Server

Proceedings of the second "international Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST'14)

Author: Absil P. -A.
Anthoine S.
Bertin N.
Bilen C.
Boumal N.
Boursier Y.
Bundervoet S.
Cambareri V.
Chabiron O.
Chainais P.
Cornelis B.
Dankova M.
Daubechies I.
Daudet L.
Davies M.
De Mol C.
De Vleeschouwer C.
Degraux K.
Determe J. -F.
Dobigeon N.
Dooms A.
Drémeau A.
Dunson D.
Duval V.
Fadili J.
Fawzi A.
Frossard P.
Geelen B.
Gigan S.
Gillis N.
Golbabaee M.
Gribonval R.
Heas P.
Herzet C.
Horlin F.
Jacques L.
Kitic S.
Lafruit G.
Liang J.
Liutkus A.
Loris I.
Louveaux J.
Maggioni M.
Magoarou L. Le
Malgouyres F.
Martina D.
Minsker S.
Mishra B.
Mory C.
Ngole F.
Peyré G.
Pizurica A.
Rajmic P.
Richard C.
Schelkens P.
Schretter C.
Sepulchre R.
Setti G.
Soussen C.
Starck J. -L.
Strawn N.
Sudhakar P.
Tourneret J. -Y.
Vaiter S.
Vandergheynst P.
Vavasis S. A.
Vukobratovic D.
Publication venue
Publication date: 01/10/2014
Field of study

The implicit objective of the biennial "international - Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST) is to foster collaboration between international scientific teams by disseminating ideas through both specific oral/poster presentations and free discussions. For its second edition, the iTWIST workshop took place in the medieval and picturesque town of Namur in Belgium, from Wednesday August 27th till Friday August 29th, 2014. The workshop was conveniently located in "The Arsenal" building within walking distance of both hotels and town center. iTWIST'14 has gathered about 70 international participants and has featured 9 invited talks, 10 oral presentations, and 14 posters on the following themes, all related to the theory, application and generalization of the "sparsity paradigm": Sparsity-driven data sensing and processing; Union of low dimensional subspaces; Beyond linear and convex inverse problem; Matrix/manifold/graph sensing/processing; Blind inverse problems and dictionary learning; Sparsity and computational neuroscience; Information theory, geometry and randomness; Complexity/accuracy tradeoffs in numerical methods; Sparsity? What's next?; Sparse machine learning and inference.Comment: 69 pages, 24 extended abstracts, iTWIST'14 website: http://sites.google.com/site/itwist1

arXiv.org e-Print Archive

Global rates of convergence for nonconvex optimization on manifolds

Author: Absil P. -A.
Boumal Nicolas
Cartis Coralia
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

We consider the minimization of a cost function

f

on a manifold

M

using Riemannian gradient descent and Riemannian trust regions (RTR). We focus on satisfying necessary optimality conditions within a tolerance

\varepsilon

. Specifically, we show that, under Lipschitz-type assumptions on the pullbacks of

f

to the tangent spaces of

M

, both of these algorithms produce points with Riemannian gradient smaller than

\varepsilon

O(1/\varepsilon^2)

iterations. Furthermore, RTR returns a point where also the Riemannian Hessian's least eigenvalue is larger than

-\varepsilon

O(1/\varepsilon^3)

iterations. There are no assumptions on initialization. The rates match their (sharp) unconstrained counterparts as a function of the accuracy

\varepsilon

(up to constants) and hence are sharp in that sense. These are the first deterministic results for global rates of convergence to approximate first- and second-order Karush-Kuhn-Tucker points on manifolds. They apply in particular for optimization constrained to compact submanifolds of

\mathbb{R}^n

, under simpler assumptions.Comment: 33 pages, IMA Journal of Numerical Analysis, 201

arXiv.org e-Print Archive

Oxford University Research Archive

Riemannian Stochastic Gradient Method for Nested Composition Optimization

Author: Tajbakhsh Sam Davanloo
Zhang Dewei
Publication venue
Publication date: 19/07/2022
Field of study

This work considers optimization of composition of functions in a nested form over Riemannian manifolds where each function contains an expectation. This type of problems is gaining popularity in applications such as policy evaluation in reinforcement learning or model customization in meta-learning. The standard Riemannian stochastic gradient methods for non-compositional optimization cannot be directly applied as stochastic approximation of inner functions create bias in the gradients of the outer functions. For two-level composition optimization, we present a Riemannian Stochastic Composition Gradient Descent (R-SCGD) method that finds an approximate stationary point, with expected squared Riemannian gradient smaller than

\epsilon

, in

O(\epsilon^{-2})

calls to the stochastic gradient oracle of the outer function and stochastic function and gradient oracles of the inner function. Furthermore, we generalize the R-SCGD algorithms for problems with multi-level nested compositional structures, with the same complexity of

O(\epsilon^{-2})

for the first-order stochastic oracle. Finally, the performance of the R-SCGD method is numerically evaluated over a policy evaluation problem in reinforcement learning

arXiv.org e-Print Archive