Asymptotic properties of bandit processes with geometric responses

Abstract

Abstract Asymptotic properties of optimal strategies for two-armed bandit processes with geometrically distributed survival times are derived. These results provide asymptotic boundary conditions and further extend structure properties of optimal strategies for bandit processes with delayed responses

    Similar works