381,654 research outputs found

    Market Experimentation in a Dynamic Differentiated-Goods Duopoly

    Get PDF
    We study the evolution of prices in a symmetric duopoly where firms are uncertain about the degree of product differentiation. Customers sometimes perceive the products as close substitutes, sometimes as highly differentiated. Firms learn about their competitive environment from the quantities sold and a background signal. As the information of the market outcomes increases with the price differential, there is scope for active learning. In a setting with linear demand curves, we derive firms' pricing strategies as payoff-symmetric mixed or correlated Markov perfect equilibria of a stochastic differential game where the common posterior belief is the natural state variable. When information has low value, firms charge the same price as would be set by myopic players, and there is no price dispersion. When firms value information more highly, on the other hand, they actively learn by creating price dispersion. This market experimentation is transient, and most likely to be observed when the firms' environment changes sufficiently often, but not too frequently.Duopoly experimentation, Bayesian learning, stochastic differential game, Markov-perfect equilibrium, mixed strategies, correlated equilibrium.

    Strategic Learning in Teams

    Get PDF
    This paper analyzes a two-player game of strategic experimentation with three-armed exponential bandits in continuous time. Players face replica bandits, with one arm that is safe in that it generates a known payoff, whereas the likelihood of the risky arms’ yielding a positive payoff is initially unknown. It is common knowledge that the types of the two risky arms are perfectly negatively correlated. I show that the efficient policy is incentive-compatible if, and only if, the stakes are high enough. Moreover, learning will be complete in any Markov perfect equilibrium with continuous value functions if, and only if, the stakes exceed a certain threshold

    Market Experimentation in a Dynamic Differentiated-Goods Duopoly

    Get PDF
    We study the evolution of prices in a symmetric duopoly where firms are uncertain about the degree of product differentiation. Customers sometimes perceive the products as close substitutes, sometimes as highly differentiated. Firms learn about their competitive environment from the quantities sold and a background signal. As the informativeness of the market outcome increases with the price differential, there is scope for active learning. In a setting with linear demand curves, we derive firms' pricing strategies as payoff-symmetric mixed or correlated Markov perfect equilibria of a stochastic differential game where the common posterior belief is the natural state variable. When information has low value, firms charge the same price as would be set by myopic players, and there is no price dispersion. When firms value information more highly, on the other hand, they actively learn by creating price dispersion. This market experimentation is transient, and most likely to be observed when the firms' environment changes sufficiently often, but not too frequently.Duopoly Experimentation, Bayesian Learning, Stochastic Differential Game, Markov Perfect Equilibrium, Mixed Strategies, Correlated Equilibrium

    Breakdowns

    Get PDF
    We study a continuous-time game of strategic experimentation in which the players try to assess the failure rate of some new equipment or technology. Breakdowns occur at the jump times of a Poisson process whose unknown intensity is either high or low. In marked contrast to existing models, we find that the cooperative value function does not exhibit smooth pasting at the efficient cut-off belief. This finding extends to the boundaries between continuation and stopping regions in Markov perfect equilibria. We characterize the unique symmetric equilibrium, construct a class of asymmetric equilibria, and elucidate the impact of bad versus good Poisson news on equilibrium outcomes

    Strategic Experimentation with Exponential Bandits

    Get PDF
    This paper studies a game of strategic experimentation with two-armed bandits whose risky arm might yield a payoff only after some exponentially distributed random time. Because of free-riding, there is an inefficiently low level of experimentation in any equilibrium where the players use stationary Markovian strategies with posterior beliefs as the state variable. After characterizing the unique symmetric Markovian equilibrium of the game, which is in mixed strategies, we construct a variety of pure-strategy equilibria. There is no equilibrium where all players use simple cut-off strategies. Equilibria where players switch finitely often between the roles of experimenter and free-rider all lead to the same pattern of information acquisition; the efficiency of these equilibria depends on the way players share the burden of experimentation among them. In equilibria where players switch roles infinitely often, they can acquire an approximately efficient amount of information, but the rate at which it is acquired still remains inefficient; moreover, the expected payoff of an experimenter exhibits the novel feature that it rises as players become more pessimistic. Finally, over the range of beliefs where players use both arms a positive fraction of the time, the symmetric equilibrium is dominated by any asymmetric one in terms of aggregate payoffs

    Strategic Experimentation with Poisson Bandits

    Get PDF
    We study a game of strategic experimentation with two-armed bandits where the risky arm distributes lump-sum payoffs according to a Poisson process. Its intensity is either high or low, and unknown to the players. We consider Markov perfect equilibria with beliefs as the state variable. As the belief process is piecewise deterministic, payoff functions solve differential-difference equations. There is no equilibrium where all players use cut-off strategies, and all equilibria exhibit an `encouragement effect' relative to the single-agent optimum. We construct asymmetric equilibria in which players have symmetric continuation values at sufficiently optimistic beliefs yet take turns playing the risky arm before all experimentation stops. Owing to the encouragement effect, these equilibria Pareto dominate the unique symmetric one for sufficiently frequent turns. Rewarding the last experimenter with a higher continuation value increases the range of beliefs where players experiment, but may reduce average payoffs at more optimistic beliefs. Some equilibria exhibit an `anticipation effect': as beliefs become more pessimistic, the continuation value of a single experimenter increases over some range because a lower belief means a shorter wait until another player takes over

    Negatively Correlated Bandits

    Get PDF
    We analyze a two-player game of strategic experimentation with two-armed bandits. Each player has to decide in continuous time whether to use a safe arm with a known payoff or a risky arm whose likelihood of delivering payoffs is initially unknown. The quality of the risky arms is perfectly negatively correlated between players. In marked contrast to the case where both risky arms are of the same type, we find that learning will be complete in any Markov perfect equilibrium if the stakes exceed a certain threshold, and that all equilibria are in cutoff strategies. For low stakes, the equilibrium is unique, symmetric, and coincides with the planner's solution. For high stakes, the equilibrium is unique, symmetric, and tantamount to myopic behavior. For intermediate stakes, there is a continuum of equilibria
    • …
    corecore