Search CORE

11 research outputs found

An Identity for Kernel Ridge Regression

Author: Kalnishkan Yuri
Zhdanov Fedor
Publication venue
Publication date: 01/01/2011
Field of study

This paper derives an identity connecting the square loss of ridge regression in on-line mode with the loss of the retrospectively best regressor. Some corollaries about the properties of the cumulative loss of on-line ridge regression are also obtained.Comment: 35 pages; extended version of ALT 2010 paper (Proceedings of ALT 2010, LNCS 6331, Springer, 2010

arXiv.org e-Print Archive

CiteSeerX

Editors' Introduction to [Algorithmic Learning Theory: 21st International Conference, ALT 2010, Canberra, Australia, October 6-8, 2010. Proceedings]

Author: Hutter Marcus
Stephan Frank
Vovk Vladimir
Zeugmann Thomas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2010
Field of study

Learning theory is an active research area that incorporates ideas, problems, and techniques from a wide range of disciplines including statistics, artificial intelligence, information theory, pattern recognition, and theoretical computer science. The research reported at the 21st International Conference on Algorithmic Learning Theory (ALT 2010) ranges over areas such as query models, online learning, inductive inference, boosting, kernel methods, complexity and learning, reinforcement learning, unsupervised learning, grammatical inference, and algorithmic forecasting. In this introduction we give an overview of the five invited talks and the regular contributions of ALT 2010

The Australian National University

Merging Time Series with Specialist Experts

Author: Kalnishkan Yuri
Scarfe Tim
Publication venue
Publication date: 01/01/2013
Field of study

Royal Holloway - Pure

Efficient second-order online kernel learning with adaptive embedding

Author: Calandriello Daniele
Lazaric Alessandro
Valko Michal
Publication venue: HAL CCSD
Publication date: 01/01/2017
Field of study

International audienceOnline kernel learning (OKL) is a flexible framework to approach prediction problems, since the large approximation space provided by reproducing kernel Hilbert spaces can contain an accurate function for the problem. Nonetheless, optimizing over this space is computationally expensive. Not only first order methods accumulate O( sqrt T ) more loss than the optimal function, but the curse of kernelization results in a O(t) per step complexity. Second-order methods get closer to the optimum much faster, suffering only O( log(T)) regret, but second-order updates are even more expensive, with a O(t 2) per-step cost. Existing approximate OKL methods try to reduce this complexity either by limiting the Support Vectors (SV) introduced in the predictor, or by avoiding the kernelization process altogether using embedding. Nonetheless, as long as the size of the approximation space or the number of SV does not grow over time, an adversary can always exploit the approximation process. In this paper, we propose PROS-N-KONS, a method that combines Nystrom sketching to project the input point in a small, accurate embedded space, and performs efficient second-order updates in this space. The embedded space is continuously updated to guarantee that the embedding remains accurate, and we show that the per-step cost only grows with the effective dimension of the problem and not with T . Moreover, the second-order updated allows us to achieve the logarithmic regret. We empirically compare our algorithm on recent large-scales benchmarks and show it performs favorably

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Efficient online learning with kernels for adversarial large scale problems

Author: Gaillard Pierre
Jézéquel Rémi
Rudi Alessandro
Publication venue: HAL CCSD
Publication date: 28/05/2019
Field of study

We are interested in a framework of online learning with kernels for low-dimensional but large-scale and potentially adversarial datasets. We study the computational and theoretical performance of online variations of kernel Ridge regression. Despite its simplicity, the algorithm we study is the first to achieve the optimal regret for a wide range of kernels with a per-round complexity of order

n^\alpha

with

\alpha < 2

. The algorithm we consider is based on approximating the kernel with the linear span of basis functions. Our contributions is two-fold: 1) For the Gaussian kernel, we propose to build the basis beforehand (independently of the data) through Taylor expansion. For

d

-dimensional inputs, we provide a (close to) optimal regret of order

O((\log n)^{d+1})

with per-round time complexity and space complexity

O((\log n)^{2d})

. This makes the algorithm a suitable choice as soon as

n \gg e^d

which is likely to happen in a scenario with small dimensional and large-scale dataset; 2) For general kernels with low effective dimension, the basis functions are updated sequentially in a data-adaptive fashion by sampling Nyström points. In this case, our algorithm improves the computational trade-off known for online kernel regression

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Competitive online algorithms for probabilistic prediction

Author: Dzhamtyrova Raisa
Publication venue
Publication date: 01/01/2020
Field of study

Royal Holloway - Pure

On discovery and exploitation of temporal structure in data sets

Author: Scarfe Tim
Publication venue
Publication date: 01/01/2015
Field of study

Royal Holloway - Pure