736 research outputs found

    Nonzero-sum Stochastic Games

    Get PDF
    This paper treats of stochastic games. We focus on nonzero-sum games and provide a detailed survey of selected recent results. In Section 1, we consider stochastic Markov games. A correlation of strategies of the players, involving ``public signals'', is described, and a correlated equilibrium theorem proved recently by Nowak and Raghavan for discounted stochastic games with general state space is presented. We also report an extension of this result to a class of undiscounted stochastic games, satisfying some uniform ergodicity condition. Stopping games are related to stochastic Markov games. In Section 2, we describe a version of Dynkin's game related to observation of a Markov process with random assignment mechanism of states to the players. Some recent contributions of the second author in this area are reported. The paper also contains a brief overview of the theory of nonzero-sum stochastic games and stopping games which is very far from being complete.average payoff stochastic games, correlated stationary equilibria, nonzero-sum games, stopping time, stopping games

    Noisy Stochastic Games

    Get PDF
    This paper establishes existence of a stationary Markov perfect equilibrium in general stochastic games with noise a component of the state that is nonatomically distributed and not directly affected by the previous periods state and actions. Noise may be simply a payoff irrelevant public randomization device, delivering known results on existence of correlated equilibrium as a special case. More generally, noise can take the form of shocks that enter into players stage payoffs and the transition probability on states. The existence result is applied to a model of industry dynamics and to a model of dynamic partisan electoral competition.

    Nonapproximability Results for Partially Observable Markov Decision Processes

    Full text link
    We show that for several variations of partially observable Markov decision processes, polynomial-time algorithms for finding control policies are unlikely to or simply don't have guarantees of finding policies within a constant factor or a constant summand of optimal. Here "unlikely" means "unless some complexity classes collapse," where the collapses considered are P=NP, P=PSPACE, or P=EXP. Until or unless these collapses are shown to hold, any control-policy designer must choose between such performance guarantees and efficient computation
    corecore