Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

Anderton, Jesse; Aslam, Javed; Aziz, Maryam; Kaufmann, Emilie

Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

Authors: Jesse Anderton
Javed Aslam
Maryam Aziz
Emilie Kaufmann
Publication date: 7 April 2018
Publisher: HAL CCSD

Abstract

International audienceWe consider the problem of near-optimal arm identification in the fixed confidence setting of the infinitely armed bandit problem when nothing is known about the arm reservoir distribution. We (1) introduce a PAC-like framework within which to derive and cast results; (2) derive a sample complexity lower bound for near-optimal arm identification; (3) propose an algorithm that identifies a nearly-optimal arm with high probability and derive an upper bound on its sample complexity which is within a log factor of our lower bound; and (4) discuss whether our log^2(1/delta) dependence is inescapable for ``two-phase'' (select arms first, identify the best later) algorithms in the infinite setting. This work permits the application of bandit models to a broader class of problems where fewer assumptions hold

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

INRIA a CCSD electronic archive server

oai:HAL:hal-01729969v1

Last time updated on 06/05/2018