Search CORE

471 research outputs found

Distributionally Time-Varying Online Stochastic Optimization under Polyak-{\L}ojasiewicz Condition with Application in Conditional Value-at-Risk Statistical Learning

Author: Farokhi Farhad
Pun Yuen-Man
Shames Iman
Publication venue
Publication date: 17/09/2023
Field of study

In this work, we consider a sequence of stochastic optimization problems following a time-varying distribution via the lens of online optimization. Assuming that the loss function satisfies the Polyak-{\L}ojasiewicz condition, we apply online stochastic gradient descent and establish its dynamic regret bound that is composed of cumulative distribution drifts and cumulative gradient biases caused by stochasticity. The distribution metric we adopt here is Wasserstein distance, which is well-defined without the absolute continuity assumption or with a time-varying support set. We also establish a regret bound of online stochastic proximal gradient descent when the objective function is regularized. Moreover, we show that the above framework can be applied to the Conditional Value-at-Risk (CVaR) learning problem. Particularly, we improve an existing proof on the discovery of the PL condition of the CVaR problem, resulting in a regret bound of online stochastic gradient descent

arXiv.org e-Print Archive

Social welfare and profit maximization from revealed preferences

Author: A Mas-Colell
A Mas-Colell
AV Boer den
AV Boer den
E Diewert
E Hazan
Hal R. Varian
HS Houthakker
J Broder
JB Hiriart-Urruty
M Babaioff
M Richter
M Sion
M Zadimoghaddam
M-F Balcan
NB Keskin
O Besbes
O Besbes
PA Samuelson
RT Rockafellar
SN Afriat
Y Nesterov
Y Nesterov
Z Wang
Publication venue
Publication date: 06/10/2018
Field of study

Consider the seller's problem of finding optimal prices for her

n

(divisible) goods when faced with a set of

m

consumers, given that she can only observe their purchased bundles at posted prices, i.e., revealed preferences. We study both social welfare and profit maximization with revealed preferences. Although social welfare maximization is a seemingly non-convex optimization problem in prices, we show that (i) it can be reduced to a dual convex optimization problem in prices, and (ii) the revealed preferences can be interpreted as supergradients of the concave conjugate of valuation, with which subgradients of the dual function can be computed. We thereby obtain a simple subgradient-based algorithm for strongly concave valuations and convex cost, with query complexity

O(m^2/\epsilon^2)

, where

\epsilon

is the additive difference between the social welfare induced by our algorithm and the optimum social welfare. We also study social welfare maximization under the online setting, specifically the random permutation model, where consumers arrive one-by-one in a random order. For the case where consumer valuations can be arbitrary continuous functions, we propose a price posting mechanism that achieves an expected social welfare up to an additive factor of

O(\sqrt{mn})

from the maximum social welfare. Finally, for profit maximization (which may be non-convex in simple cases), we give nearly matching upper and lower bounds on the query complexity for separable valuations and cost (i.e., each good can be treated independently)

arXiv.org e-Print Archive

Crossref

Second-order Online Nonconvex Optimization

Author: Lesage-Landry Antoine
Shames Iman
Taylor Joshua A.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 09/06/2020
Field of study

We present the online Newton's method, a single-step second-order method for online nonconvex optimization. We analyze its performance and obtain a dynamic regret bound that is linear in the cumulative variation between round optima. We show that if the variation between round optima is limited, the method leads to a constant regret bound. In the general case, the online Newton's method outperforms online convex optimization algorithms for convex functions and performs similarly to a specialized algorithm for strongly convex functions. We simulate the performance of the online Newton's method on a nonlinear, nonconvex moving target localization example and find that it outperforms a first-order approach

arXiv.org e-Print Archive

PolyPublie