Efficient Linear Bandits through Matrix Sketching

I. Kuzborskij; L. Cella; N. Cesa-Bianchi

Efficient Linear Bandits through Matrix Sketching

Authors: I. Kuzborskij
L. Cella
N. Cesa-Bianchi
Publication date: 1 January 2019
Publisher: PMLR

Abstract

We prove that two popular linear contextual bandit algorithms, OFUL and Thompson Sampling, can be made efficient using Frequent Directions, a deterministic online sketching technique. More precisely, we show that a sketch of size m allows a O(md) update time for both algorithms, as opposed to \u2126(d 2 ) required by their non-sketched versions in general (where d is the dimension of context vectors). This computational speedup is accompanied by regret bounds of order (1 + \u3b5m) 3/2d 1a T for OFUL and of order (1 + \u3b5m)d 3/2 1a T for Thompson Sampling, where \u3b5m is bounded by the sum of the tail eigenvalues not covered by the sketch. In particular, when the selected contexts span a subspace of dimension at most m, our algorithms have a regret bound matching that of their slower, non-sketched counterparts. Experiments on real-world datasets corroborate our theoretical results

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

AIR Universita degli studi di Milano

oai:air.unimi.it:2434/640342

Last time updated on 30/04/2019