Search CORE

11 research outputs found

When Data Compression and Statistics Disagree: Two Frequentist Challenges for the Minimum Description Length Principle

Author: Erven T.A.L. (Tim) van
Publication venue
Publication date: 23/11/2010
Field of study

CWI's Institutional Repository

The momentum problem in MDL and Bayesian prediction

Author: Erven T.A.L. (Tim) van
Publication venue: 'Universite Catholique de Louvain'
Publication date: 01/05/2006
Field of study

CWI's Institutional Repository

Learning the Switching Rate by Discretising Bernoulli Sources Online

Author: Erven T.A.L. (Tim) van
Rooij S. (Steven) de
Publication venue: Journal of Machine Learning Research
Publication date: 01/01/2009
Field of study

CWI's Institutional Repository

MetaGrad: multiple learning rates in online learning

Author: Erven T.A.L. (Tim) van
Koolen-Wijkstra W.M. (Wouter)
Publication venue
Publication date: 05/12/2016
Field of study

CWI's Institutional Repository

Lipschitz Adaptivity with Multiple Learning Rates in Online Learning

Author: Erven T.A.L. (Tim) van
Koolen-Wijkstra W.M. (Wouter)
Mhammedi Z. (Zakaria)
Publication venue
Publication date: 25/06/2019
Field of study

We aim to design adaptive online learning algorithms that take advantage of any special structure that might be present in the learning task at hand, with as little manual tuning by the user as possible. A fundamental obstacle that comes up in the design of such adaptive algorithms is to calibrate a so-called step-size or learning rate hyperparameter depending on variance, gradient norms, etc. A recent technique promises to overcome this difficulty by maintaining multiple learning rates in parallel. This technique has been applied in the MetaGrad algorithm for online convex optimization and the Squint algorithm for prediction with expert advice. However, in both cases the user still has to provide in advance a Lipschitz hyperparameter that bounds the norm of the gradients. Although this hyperparameter is typically not available in advance, tuning it correctly is crucial: if it is set too small, the methods may fail completely; but if it is taken too large, performance deteriorates significantly. In the present work we remove this Lipschitz hyperparameter by designing new versions of MetaGrad and Squint that adapt to its optimal value automatically. We achieve this by dynamically updating the set of active learning rates. For MetaGrad, we further improve the computational efficiency of handling constraints on the domain of prediction, and we remove the need to specify the number of rounds in advance

CWI's Institutional Repository

Catching Up Faster by Switching Sooner: A Predictive Approach to Adaptive Estimation with an application to the AIC-BIC Dilemma

Author: Erven T.A.L. (Tim) van
Grünwald P.D. (Peter)
Rooij S. (Steven) de
Publication venue: 'Wiley'
Publication date: 01/01/2012
Field of study

CWI's Institutional Repository

Open problem: Fast and optimal online portfolio selection

Author: Erven T.A.L. (Tim) van
Hoeven D. (Dirk) van der
Koolen-Wijkstra W.M. (Wouter)
Kotlowski W.T. (Wojciech)
Publication venue
Publication date: 09/07/2020
Field of study

Online portfolio selection has received much attention in the COLT community since its introduction by Cover, but all state-of-the-art methods fall short in at least one of the following ways: they are either i) computationally infeasible; or ii) they do not guarantee optimal regret; or iii) they assume the gradients are bounded, which is unnecessary and cannot be guaranteed. We are interested in a natural follow-the-regularized-leader (FTRL) approach based on the log barrier regularizer, which is computationally feasible. The open problem we put before the community is to formally prove whether this approach achieves the optimal regret. Resolving this question will likely lead to new techniques to analyse FTRL algorithms. There are also interesting technical connections to self-concordance, which has previously been used in the context of bandit convex optimization

CWI's Institutional Repository

Rényi Divergence and Majorization

Author: Erven T.A.L. (Tim) van
Harremoës P. (Peter)
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

CWI's Institutional Repository

Tracking experts that learn by evolving past posteriors

Author: Erven T.A.L. (Tim) van
Koolen-Wijkstra W.M. (Wouter)
Publication venue: 'Cornell University Library'
Publication date: 01/02/2009
Field of study

CWI's Institutional Repository

Switching between hidden Markov models using Fixed Share

Author: Erven T.A.L. (Tim) van
Koolen-Wijkstra W.M. (Wouter)
Publication venue: 'Cornell University Library'
Publication date: 01/02/2010
Field of study

CWI's Institutional Repository