Strategic Experimentation with Poisson Bandits

Keller, Godfrey; Rady, Sven

research

oai:epub.ub.uni-muenchen.de:13292

Strategic Experimentation with Poisson Bandits

Authors: Godfrey Keller
Sven Rady
Publication date: 1 May 2009
Publisher
Doi

Abstract

We study a game of strategic experimentation with two-armed bandits where the risky arm distributes lump-sum payoffs according to a Poisson process. Its intensity is either high or low, and unknown to the players. We consider Markov perfect equilibria with beliefs as the state variable. As the belief process is piece-wise deterministic, payoff functions solve differential-difference equations. There is no equilibrium where all players use cut-off strategies, and all equilibria exhibit an ‘encouragement effect’ relative to the single-agent optimum. We construct asymmetric equilibria in which players have symmetric continuation values at sufficiently optimistic beliefs yet take turns playing the risky arm before all experimentation stops. Owing to the encouragement effect, these equilibria Pareto dominate the unique symmetric one for sufficiently frequent turns. Rewarding the last experimenter with a higher continuation value increases the range of beliefs where players experiment, but may reduce average payoffs at more optimistic beliefs. Some equilibria exhibit an ‘anticipation effect’: as beliefs become more pessimistic, the continuation value of a single experimenter increases over some range because a lower belief means a shorter wait until another player takes over

Similar works

Full text

Open in the Core reader

Download PDF

Open Access LMU

oai:epub.ub.uni-muenchen.de:13...

Last time updated on 19/07/2013

This paper was published in Open Access LMU.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.