Search CORE

45,270 research outputs found

Variance-Reduced and Projection-Free Stochastic Optimization

Author: Hazan Elad
Luo Haipeng
Publication venue
Publication date: 01/01/2016
Field of study

The Frank-Wolfe optimization algorithm has recently regained popularity for machine learning applications due to its projection-free property and its ability to handle structured constraints. However, in the stochastic learning setting, it is still relatively understudied compared to the gradient descent counterpart. In this work, leveraging a recent variance reduction technique, we propose two stochastic Frank-Wolfe variants which substantially improve previous results in terms of the number of stochastic gradient evaluations needed to achieve

1-\epsilon

accuracy. For example, we improve from

O(\frac{1}{\epsilon})

O(\ln\frac{1}{\epsilon})

if the objective function is smooth and strongly convex, and from

O(\frac{1}{\epsilon^2})

O(\frac{1}{\epsilon^{1.5}})

if the objective function is smooth and Lipschitz. The theoretical improvement is also observed in experiments on real-world datasets for a multiclass classification application

arXiv.org e-Print Archive

Princeton University Open Access Repository

Stochastic Frank-Wolfe Methods for Nonconvex Optimization

Author: Poczos Barnabas
Reddi Sashank J.
Smola Alex
Sra Suvrit
Publication venue
Publication date: 29/07/2016
Field of study

We study Frank-Wolfe methods for nonconvex stochastic and finite-sum optimization problems. Frank-Wolfe methods (in the convex case) have gained tremendous recent interest in machine learning and optimization communities due to their projection-free property and their ability to exploit structured constraints. However, our understanding of these algorithms in the nonconvex setting is fairly limited. In this paper, we propose nonconvex stochastic Frank-Wolfe methods and analyze their convergence properties. For objective functions that decompose into a finite-sum, we leverage ideas from variance reduction techniques for convex optimization to obtain new variance reduced nonconvex Frank-Wolfe methods that have provably faster convergence than the classical Frank-Wolfe method. Finally, we show that the faster convergence rates of our variance reduced methods also translate into improved convergence rates for the stochastic setting

arXiv.org e-Print Archive

Crossref

An Asynchronous Parallel Randomized Kaczmarz Algorithm

Author: Liu Ji
Sridhar Srikrishna
Wright Stephen J.
Publication venue
Publication date: 07/06/2014
Field of study

We describe an asynchronous parallel variant of the randomized Kaczmarz (RK) algorithm for solving the linear system

Ax=b

. The analysis shows linear convergence and indicates that nearly linear speedup can be expected if the number of processors is bounded by a multiple of the number of rows in

A

arXiv.org e-Print Archive

CiteSeerX