Search CORE

82 research outputs found

Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations

Author: McMahan H. Brendan
Orabona Francesco
Publication venue
Publication date: 21/05/2014
Field of study

We study algorithms for online linear optimization in Hilbert spaces, focusing on the case where the player is unconstrained. We develop a novel characterization of a large class of minimax algorithms, recovering, and even improving, several previous results as immediate corollaries. Moreover, using our tools, we develop an algorithm that provides a regret bound of

\mathcal{O}\Big(U \sqrt{T \log(U \sqrt{T} \log^2 T +1)}\Big)

, where

U

is the

L_2

norm of an arbitrary comparator and both

T

and

U

are unknown to the player. This bound is optimal up to

\sqrt{\log \log T}

terms. When

T

is known, we derive an algorithm with an optimal regret bound (up to constant factors). For both the known and unknown

T

case, a Normal approximation to the conditional value of the game proves to be the key analysis tool.Comment: Proceedings of the 27th Annual Conference on Learning Theory (COLT 2014

arXiv.org e-Print Archive

CiteSeerX

Adaptive Bound Optimization for Online Convex Optimization

Author: McMahan H. Brendan
Streeter Matthew
Publication venue
Publication date: 01/01/2010
Field of study

We introduce a new online convex optimization algorithm that adaptively chooses its regularization function based on the loss functions observed so far. This is in contrast to previous algorithms that use a fixed regularization function such as L2-squared, and modify it only via a single time-dependent parameter. Our algorithm's regret bounds are worst-case optimal, and for certain realistic classes of loss functions they are much better than existing bounds. These bounds are problem-dependent, which means they can exploit the structure of the actual problem instance. Critically, however, our algorithm does not need to know this structure in advance. Rather, we prove competitive guarantees that show the algorithm provides a bound within a constant factor of the best possible bound (of a certain functional form) in hindsight.Comment: Updates to match final COLT versio

arXiv.org e-Print Archive

CiteSeerX

Graph Oracle Models, Lower Bounds, and Gaps for Parallel Stochastic Optimization

Author: McMahan Brendan
Smith Adam
Srebro Nathan
Wang Jialei
Woodworth Blake
Publication venue
Publication date: 01/12/2018
Field of study

We suggest a general oracle-based framework that captures different parallel stochastic optimization settings described by a dependency graph, and derive generic lower bounds in terms of this graph. We then use the framework and derive lower bounds for several specific parallel optimization settings, including delayed updates and parallel processing with intermittent communication. We highlight gaps between lower and upper bounds on the oracle complexity, and cases where the "natural" algorithms are not known to be optimal

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)