Search CORE

2,846 research outputs found

Variance-Reduced and Projection-Free Stochastic Optimization

Author: Hazan Elad
Luo Haipeng
Publication venue
Publication date: 01/01/2016
Field of study

The Frank-Wolfe optimization algorithm has recently regained popularity for machine learning applications due to its projection-free property and its ability to handle structured constraints. However, in the stochastic learning setting, it is still relatively understudied compared to the gradient descent counterpart. In this work, leveraging a recent variance reduction technique, we propose two stochastic Frank-Wolfe variants which substantially improve previous results in terms of the number of stochastic gradient evaluations needed to achieve

1-\epsilon

accuracy. For example, we improve from

O(\frac{1}{\epsilon})

O(\ln\frac{1}{\epsilon})

if the objective function is smooth and strongly convex, and from

O(\frac{1}{\epsilon^2})

O(\frac{1}{\epsilon^{1.5}})

if the objective function is smooth and Lipschitz. The theoretical improvement is also observed in experiments on real-world datasets for a multiclass classification application

arXiv.org e-Print Archive

Princeton University Open Access Repository

Towards Minimax Online Learning with Unknown Time Horizon

Author: Luo Haipeng
Schapire Robert E.
Publication venue
Publication date: 06/10/2013
Field of study

We consider online learning when the time horizon is unknown. We apply a minimax analysis, beginning with the fixed horizon case, and then moving on to two unknown-horizon settings, one that assumes the horizon is chosen randomly according to some known distribution, and the other which allows the adversary full control over the horizon. For the random horizon setting with restricted losses, we derive a fully optimal minimax algorithm. And for the adversarial horizon setting, we prove a nontrivial lower bound which shows that the adversary obtains strictly more power than when the horizon is fixed and known. Based on the minimax solution of the random horizon setting, we then propose a new adaptive algorithm which "pretends" that the horizon is drawn from a distribution from a special family, but no matter how the actual horizon is chosen, the worst-case regret is of the optimal rate. Furthermore, our algorithm can be combined and applied in many ways, for instance, to online convex optimization, follow the perturbed leader, exponential weights algorithm and first order bounds. Experiments show that our algorithm outperforms many other existing algorithms in an online linear optimization setting

arXiv.org e-Print Archive

CiteSeerX

Princeton University Open Access Repository