Pure Exploration with Multiple Correct Answers

Degenne, Rémy; Koolen, Wouter M.

research

Pure Exploration with Multiple Correct Answers

Authors: Rémy Degenne
Wouter M. Koolen
Publication date: 1 January 2019
Publisher

Abstract

We determine the sample complexity of pure exploration bandit problems with multiple good answers. We derive a lower bound using a new game equilibrium argument. We show how continuity and convexity properties of single-answer problems ensures that the Track-and-Stop algorithm has asymptotically optimal sample complexity. However, that convexity is lost when going to the multiple-answer setting. We present a new algorithm which extends Track-and-Stop to the multiple-answer case and has asymptotic sample complexity matching the lower bound

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

NARCIS

Last time updated on 29/05/2021

CWI's Institutional Repository

oai:cwi.nl:29299

Last time updated on 18/04/2020