research

Stationary Multi Choice Bandit Problems

Abstract

This note shows that the optimal choice of k simultaneous experiments in a stationary multi-armed bandit problem can be characterized in terms of the Gittins index of each arm. The index characterization remains equally valid after the introduction of switching costs.multi-armed bandits, Gittins index, Stationary bandits, Job search

    Similar works