Search CORE

1,513 research outputs found

Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function

Author: Hsieh Cho-Jui
Liu Liu
Liu Xuanqing
Tao Dacheng
Publication venue
Publication date: 18/10/2023
Field of study

Trust-region (TR) and adaptive regularization using cubics (ARC) have proven to have some very appealing theoretical properties for non-convex optimization by concurrently computing function value, gradient, and Hessian matrix to obtain the next search direction and the adjusted parameters. Although stochastic approximations help largely reduce the computational cost, it is challenging to theoretically guarantee the convergence rate. In this paper, we explore a family of stochastic TR and ARC methods that can simultaneously provide inexact computations of the Hessian matrix, gradient, and function values. Our algorithms require much fewer propagations overhead per iteration than TR and ARC. We prove that the iteration complexity to achieve

\epsilon

-approximate second-order optimality is of the same order as the exact computations demonstrated in previous studies. Additionally, the mild conditions on inexactness can be met by leveraging a random sampling technology in the finite-sum minimization problem. Numerical experiments with a non-convex problem support these findings and demonstrate that, with the same or a similar number of iterations, our algorithms require less computational overhead per iteration than current second-order methods.Comment: arXiv admin note: text overlap with arXiv:1809.0985

arXiv.org e-Print Archive

Newton-Type Methods for Non-Convex Optimization Under Inexact Hessian Information

Author: Mahoney Michael W.
Roosta Fred
Xu Peng
Publication venue
Publication date: 14/05/2019
Field of study

We consider variants of trust-region and cubic regularization methods for non-convex optimization, in which the Hessian matrix is approximated. Under mild conditions on the inexact Hessian, and using approximate solution of the corresponding sub-problems, we provide iteration complexity to achieve

\epsilon

-approximate second-order optimality which have shown to be tight. Our Hessian approximation conditions constitute a major relaxation over the existing ones in the literature. Consequently, we are able to show that such mild conditions allow for the construction of the approximate Hessian through various random sampling methods. In this light, we consider the canonical problem of finite-sum minimization, provide appropriate uniform and non-uniform sub-sampling strategies to construct such Hessian approximations, and obtain optimal iteration complexity for the corresponding sub-sampled trust-region and cubic regularization methods.Comment: 32 page

arXiv.org e-Print Archive

University of Queensland eSpace