Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits

Besson, Lilian; Kaufmann, Emilie; Maillard, Odalric-Ambrym; Seznec, Julien

Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits

Authors: Lilian Besson
Emilie Kaufmann
Odalric-Ambrym Maillard
Julien Seznec
Publication date: 1 March 2022
Publisher: Microtome Publishing

Abstract

International audienceWe introduce GLR-klUCB, a novel algorithm for the piecewise iid non-stationary bandit problem with bounded rewards. This algorithm combines an efficient bandit algorithm, kl-UCB, with an efficient, parameter-free, changepoint detector, the Bernoulli Generalized Likelihood Ratio Test, for which we provide new theoretical guarantees of independent interest. Unlike previous non-stationary bandit algorithms using a change-point detector, GLR-klUCB does not need to be calibrated based on prior knowledge on the arms' means. We prove that this algorithm can attain a