Search CORE

50 research outputs found

Riemannian Adaptive Regularized Newton Methods with H\"older Continuous Hessians

Author: Jiang Rujun
Zhang Chenyu
Publication venue
Publication date: 07/09/2023
Field of study

This paper presents strong worst-case iteration and operation complexity guarantees for Riemannian adaptive regularized Newton methods, a unified framework encompassing both Riemannian adaptive regularization (RAR) methods and Riemannian trust region (RTR) methods. We comprehensively characterize the sources of approximation in second-order manifold optimization methods: the objective function's smoothness, retraction's smoothness, and subproblem solver's inexactness. Specifically, for a function with a

\mu

-H\"older continuous Hessian, when equipped with a retraction featuring a

\nu

-H\"older continuous differential and a

\theta

-inexact subproblem solver, both RTR and RAR with

2+\alpha

regularization (where

\alpha=\min\{\mu,\nu,\theta\}

) locate an

(\epsilon,\epsilon^{\alpha/(1+\alpha)})

-approximate second-order stationary point within at most

O(\epsilon^{-(2+\alpha)/(1+\alpha)})

iterations and at most

\tilde{O}(\epsilon^{-(4+3\alpha)/(2(1+\alpha))})

Hessian-vector products. These complexity results are novel and sharp, and reduce to an iteration complexity of

O(\epsilon^{-3/2})

and an operation complexity of

\tilde{O}(\epsilon^{-7/4})

when

\alpha=1

arXiv.org e-Print Archive

An accelerated first-order method with complexity analysis for solving cubic regularization subproblems

Author: Jiang Rujun
Yue Man-Chung
Zhou Zhishuo
Publication venue
Publication date: 01/06/2021
Field of study

We propose a first-order method to solve the cubic regularization subproblem (CRS) based on a novel reformulation. The reformulation is a constrained convex optimization problem whose feasible region admits an easily computable projection. Our reformulation requires computing the minimum eigenvalue of the Hessian. To avoid the expensive computation of the exact minimum eigenvalue, we develop a surrogate problem to the reformulation where the exact minimum eigenvalue is replaced with an approximate one. We then apply first-order methods such as the Nesterov's accelerated projected gradient method (APG) and projected Barzilai-Borwein method to solve the surrogate problem. As our main theoretical contribution, we show that when an

\epsilon

-approximate minimum eigenvalue is computed by the Lanczos method and the surrogate problem is approximately solved by APG, our approach returns an

\epsilon

-approximate solution to CRS in

\tilde O(\epsilon^{-1/2})

matrix-vector multiplications (where

\tilde O(\cdot)

hides the logarithmic factors). Numerical experiments show that our methods are comparable to and outperform the Krylov subspace method in the easy and hard cases, respectively. We further implement our methods as subproblem solvers of adaptive cubic regularization methods, and numerical results show that our algorithms are comparable to the state-of-the-art algorithms

arXiv.org e-Print Archive

PolyU Institutional Repository

DC Algorithm for Sample Average Approximation of Chance Constrained Programming: Convergence and Numerical Results

Author: Balzano Laura
Jiang Rujun
Kong Qingyuan
Wang Peng
Publication venue
Publication date: 02/04/2023
Field of study

Chance constrained programming refers to an optimization problem with uncertain constraints that must be satisfied with at least a prescribed probability level. In this work, we study a class of structured chance constrained programs in the data-driven setting, where the objective function is a difference-of-convex (DC) function and the functions in the chance constraint are all convex. By exploiting the structure, we reformulate it into a DC constrained DC program. Then, we propose a proximal DC algorithm for solving the reformulation. Moreover, we prove the convergence of the proposed algorithm based on the Kurdyka-\L ojasiewicz property and derive the iteration complexity for finding an approximate KKT point. We point out that the proposed pDCA and its associated analysis apply to general DC constrained DC programs, which may be of independent interests. To support and complement our theoretical development, we show via numerical experiments that our proposed approach is competitive with a host of existing approaches.Comment: 31 pages, 3 table

arXiv.org e-Print Archive

Penalty-based Methods for Simple Bilevel Optimization under H\"{o}lderian Error Bounds

Author: Chen Pengyu
Jiang Rujun
Shi Xu
Wang Jiulin
Publication venue
Publication date: 03/02/2024
Field of study

This paper investigates simple bilevel optimization problems where the upper-level objective minimizes a composite convex function over the optimal solutions of a composite convex lower-level problem. Existing methods for such problems either only guarantee asymptotic convergence, have slow sublinear rates, or require strong assumptions. To address these challenges, we develop a novel penalty-based approach that employs the accelerated proximal gradient (APG) method. Under an

\alpha

-H\"{o}lderian error bound condition on the lower-level objective, our algorithm attains an

(\epsilon,l_F^{-\beta}\epsilon^{\beta})

-optimal solution for any

\beta>0

within

\mathcal{O}\left(\sqrt{\frac{L_{f_1}}{\epsilon }}\right)+\mathcal{O}\left(\sqrt{\frac{l_F^{\max\{\alpha,\beta\}}L_{g_1}}{\epsilon^{\max\{\alpha,\beta\}}}}\right)

iterations, where

l_F

L_{f_1}

and

L_{g_1}

denote the Lipschitz constants of the upper-level objective, the gradients of the smooth parts of the upper- and lower-level objectives, respectively. If the smooth part of the upper-level objective is strongly convex, the result improves further. We also establish the complexity results when both upper- and lower-level objectives are general convex nonsmooth functions. Numerical experiments demonstrate the effectiveness of our algorithms

arXiv.org e-Print Archive