Search CORE

19,888 research outputs found

A closer look at adaptive regret

Author: Adamskiy Dmitry
Chernov Alexey
Koolen Wouter
Vovk Vladimir
Publication venue
Publication date: 13/01/2016
Field of study

For the prediction with expert advice setting, we consider methods to construct algorithms that have low adaptive regret. The adaptive regret of an algorithm on a time interval [t1,t2] is the loss of the algorithm minus the loss of the best expert over that interval. Adaptive regret measures how well the algorithm approximates the best expert locally, and so is different from, although closely related to, both the classical regret, measured over an initial time interval [1,t], and the tracking regret, where the algorithm is compared to a good sequence of experts over [1,t]. We investigate two existing intuitive methods for deriving algorithms with low adaptive regret, one based on specialist experts and the other based on restarts. Quite surprisingly, we show that both methods lead to the same algorithm, namely Fixed Share, which is known for its tracking regret. We provide a thorough analysis of the adaptive regret of Fixed Share. We obtain the exact worst-case adaptive regret for Fixed Share, from which the classical tracking bounds follow. We prove that Fixed Share is

CiteSeerX

CWI's Institutional Repository

Royal Holloway - Pure

University of Brighton Research Portal

A closer look at adaptive regret

Author: Adamskyi D. (Dmitri)
Chernov A. (Alexey)
Koolen-Wijkstra W.M. (Wouter)
Vovk V.
Publication venue
Publication date: 06/04/2016
Field of study

CWI's Institutional Repository

A Closer Look at Adaptive Regret

Author: A. Chernov
G.I. Shamir
M. Herbster
N. Littlestone
O. Bousquet
V. Vovk
V. Vovk
Y. Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Linear Coupling: An Ultimate Unification of Gradient and Mirror Descent

Author: Allen-Zhu Zeyuan
Orecchia Lorenzo
Publication venue
Publication date: 07/11/2016
Field of study

First-order methods play a central role in large-scale machine learning. Even though many variations exist, each suited to a particular problem, almost all such methods fundamentally rely on two types of algorithmic steps: gradient descent, which yields primal progress, and mirror descent, which yields dual progress. We observe that the performances of gradient and mirror descent are complementary, so that faster algorithms can be designed by LINEARLY COUPLING the two. We show how to reconstruct Nesterov's accelerated gradient methods using linear coupling, which gives a cleaner interpretation than Nesterov's original proofs. We also discuss the power of linear coupling by extending it to many other settings that Nesterov's methods cannot apply to.Comment: A new section added; polished writin

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

Dagstuhl Research Online Publication Server

Second-order Quantile Methods for Experts and Combinatorial Games

Author: Koolen Wouter M.
van Erven Tim
Publication venue
Publication date: 01/01/2015
Field of study

We aim to design strategies for sequential decision making that adjust to the difficulty of the learning problem. We study this question both in the setting of prediction with expert advice, and for more general combinatorial decision tasks. We are not satisfied with just guaranteeing minimax regret rates, but we want our algorithms to perform significantly better on easy data. Two popular ways to formalize such adaptivity are second-order regret bounds and quantile bounds. The underlying notions of 'easy data', which may be paraphrased as "the learning problem has small variance" and "multiple decisions are useful", are synergetic. But even though there are sophisticated algorithms that exploit one of the two, no existing algorithm is able to adapt to both. In this paper we outline a new method for obtaining such adaptive algorithms, based on a potential function that aggregates a range of learning rates (which are essential tuning parameters). By choosing the right prior we construct efficient algorithms and show that they reap both benefits by proving the first bounds that are both second-order and incorporate quantiles

arXiv.org e-Print Archive

CWI's Institutional Repository

Queensland University of Technology ePrints Archive

Shifting Regret, Mirror Descent, and Matrices

Author: Gyorgy A
Szepesvari C
Publication venue: Journal of Machine Learning Research
Publication date: 24/04/2016
Field of study

We consider the problem of online prediction in changing environments. In this framework the performance of a predictor is evaluated as the loss relative to an arbitrarily changing predictor, whose individual components come from a base class of predictors. Typical results in the literature consider different base classes (experts, linear predictors on the simplex, etc.) separately. Introducing an arbitrary mapping inside the mirror decent algorithm, we provide a framework that unifies and extends existing results. As an example, we prove new shifting regret bounds for matrix prediction problems

Spiral - Imperial College Digital Repository