Search CORE

3 research outputs found

Improving the Convergence Rate of One-Point Zeroth-Order Optimization using Residual Feedback

Author: Ji Kaiyi
Zavlanos Michael M.
Zhang Yan
Zhou Yi
Publication venue
Publication date: 18/06/2020
Field of study

Many existing zeroth-order optimization (ZO) algorithms adopt two-point feedback schemes due to their fast convergence rate compared to one-point feedback schemes. However, two-point schemes require two evaluations of the objective function at each iteration, which can be impractical in applications where the data are not all available a priori, e.g., in online optimization. In this paper, we propose a novel one-point feedback scheme that queries the function value only once at each iteration and estimates the gradient using the residual between two consecutive feedback points. When optimizing a deterministic Lipschitz function, we show that the query complexity of ZO with the proposed one-point residual feedback matches that of ZO with the existing two-point feedback schemes. Moreover, the query complexity of the proposed algorithm can be improved when the objective function has Lipschitz gradient. Then, for stochastic bandit optimization problems, we show that ZO with one-point residual feedback achieves the same convergence rate as that of ZO with two-point feedback with uncontrollable data samples. We demonstrate the effectiveness of the proposed one-point residual feedback via extensive numerical experiments

arXiv.org e-Print Archive

Convergence Analysis of Nonconvex Distributed Stochastic Zeroth-order Coordinate Method

Author: Bailey Colleen P.
Dong Yunlong
Fu Shengli
Xie Dong
Yao Lisha
Zhang Shengjun
Publication venue
Publication date: 13/10/2021
Field of study

This paper investigates the stochastic distributed nonconvex optimization problem of minimizing a global cost function formed by the summation of

n

local cost functions. We solve such a problem by involving zeroth-order (ZO) information exchange. In this paper, we propose a ZO distributed primal-dual coordinate method (ZODIAC) to solve the stochastic optimization problem. Agents approximate their own local stochastic ZO oracle along with coordinates with an adaptive smoothing parameter. We show that the proposed algorithm achieves the convergence rate of

\mathcal{O}(\sqrt{p}/\sqrt{T})

for general nonconvex cost functions. We demonstrate the efficiency of proposed algorithms through a numerical example in comparison with the existing state-of-the-art centralized and distributed ZO algorithms

arXiv.org e-Print Archive

Zeroth-Order Algorithms for Stochastic Distributed Nonconvex Optimization

Author: Johansson Karl H.
Yang Tao
Yi Xinlei
Zhang Shengjun
Publication venue
Publication date: 14/06/2021
Field of study

In this paper, we consider a stochastic distributed nonconvex optimization problem with the cost function being distributed over

n

agents having access only to zeroth-order (ZO) information of the cost. This problem has various machine learning applications. As a solution, we propose two distributed ZO algorithms, in which at each iteration each agent samples the local stochastic ZO oracle at two points with an adaptive smoothing parameter. We show that the proposed algorithms achieve the linear speedup convergence rate

\mathcal{O}(\sqrt{p/(nT)})

for smooth cost functions and

\mathcal{O}(p/(nT))

convergence rate when the global cost function additionally satisfies the Polyak--Lojasiewicz (P--L) condition, where

p

and

T

are the dimension of the decision variable and the total number of iterations, respectively. To the best of our knowledge, this is the first linear speedup result for distributed ZO algorithms, which enables systematic processing performance improvements by adding more agents. We also show that the proposed algorithms converge linearly when considering deterministic centralized optimization problems under the P--L condition. We demonstrate through numerical experiments the efficiency of our algorithms on generating adversarial examples from deep neural networks in comparison with baseline and recently proposed centralized and distributed ZO algorithms

arXiv.org e-Print Archive