Search CORE

20,628 research outputs found

Using gradient directions to get global convergence of Newton-type methods

Author: di Serafino Daniela
Toraldo Gerardo
Viola Marco
Publication venue
Publication date: 01/01/2020
Field of study

The renewed interest in Steepest Descent (SD) methods following the work of Barzilai and Borwein [IMA Journal of Numerical Analysis, 8 (1988)] has driven us to consider a globalization strategy based on SD, which is applicable to any line-search method. In particular, we combine Newton-type directions with scaled SD steps to have suitable descent directions. Scaling the SD directions with a suitable step length makes a significant difference with respect to similar globalization approaches, in terms of both theoretical features and computational behavior. We apply our strategy to Newton's method and the BFGS method, with computational results that appear interesting compared with the results of well-established globalization strategies devised ad hoc for those methods.Comment: 22 pages, 11 Figure

arXiv.org e-Print Archive

Newton-MR: Inexact Newton Method With Minimum Residual Sub-problem Solver

Author: Liu Yang
Mahoney Michael W.
Roosta Fred
Xu Peng
Publication venue
Publication date: 15/10/2021
Field of study

We consider a variant of inexact Newton Method, called Newton-MR, in which the least-squares sub-problems are solved approximately using Minimum Residual method. By construction, Newton-MR can be readily applied for unconstrained optimization of a class of non-convex problems known as invex, which subsumes convexity as a sub-class. For invex optimization, instead of the classical Lipschitz continuity assumptions on gradient and Hessian, Newton-MR's global convergence can be guaranteed under a weaker notion of joint regularity of Hessian and gradient. We also obtain Newton-MR's problem-independent local convergence to the set of minima. We show that fast local/global convergence can be guaranteed under a novel inexactness condition, which, to our knowledge, is much weaker than the prior related works. Numerical results demonstrate the performance of Newton-MR as compared with several other Newton-type alternatives on a few machine learning problems.Comment: 35 page

arXiv.org e-Print Archive

Practical Inexact Proximal Quasi-Newton Method with Global Complexity Analysis

Author: Scheinberg Katya
Tang Xiaocheng
Publication venue
Publication date: 14/07/2015
Field of study

Recently several methods were proposed for sparse optimization which make careful use of second-order information [10, 28, 16, 3] to improve local convergence rates. These methods construct a composite quadratic approximation using Hessian information, optimize this approximation using a first-order method, such as coordinate descent and employ a line search to ensure sufficient descent. Here we propose a general framework, which includes slightly modified versions of existing algorithms and also a new algorithm, which uses limited memory BFGS Hessian approximations, and provide a novel global convergence rate analysis, which covers methods that solve subproblems via coordinate descent

arXiv.org e-Print Archive

CiteSeerX

Newton-Type Methods for Non-Convex Optimization Under Inexact Hessian Information

Author: Mahoney Michael W.
Roosta Fred
Xu Peng
Publication venue
Publication date: 14/05/2019
Field of study

We consider variants of trust-region and cubic regularization methods for non-convex optimization, in which the Hessian matrix is approximated. Under mild conditions on the inexact Hessian, and using approximate solution of the corresponding sub-problems, we provide iteration complexity to achieve

\epsilon

-approximate second-order optimality which have shown to be tight. Our Hessian approximation conditions constitute a major relaxation over the existing ones in the literature. Consequently, we are able to show that such mild conditions allow for the construction of the approximate Hessian through various random sampling methods. In this light, we consider the canonical problem of finite-sum minimization, provide appropriate uniform and non-uniform sub-sampling strategies to construct such Hessian approximations, and obtain optimal iteration complexity for the corresponding sub-sampled trust-region and cubic regularization methods.Comment: 32 page

arXiv.org e-Print Archive

University of Queensland eSpace