246 research outputs found

    Mean-field approximation of stochastic population processes in games

    Get PDF
    We here establish an upper bound on the probability for deviations of a Markov population process from its mean-field approximation.

    Consistency of vanishing smooth fictitious play

    Full text link
    We discuss consistency of Vanishing Smooth Fictitious Play, a strategy in the context of game theory, which can be regarded as a smooth fictitious play procedure, where the smoothing parameter is time-dependent and asymptotically vanishes. This answers a question initially raised by Drew Fudenberg and Satoru Takahashi.Comment: 17 page

    Convergence analysis of Adaptive Biasing Potential methods for diffusion processes

    Full text link
    This article is concerned with the mathematical analysis of a family of adaptive importance sampling algorithms applied to diffusion processes. These methods, referred to as Adaptive Biasing Potential methods, are designed to efficiently sample the invariant distribution of the diffusion process, thanks to the approximation of the associated free energy function (relative to a reaction coordinate). The bias which is introduced in the dynamics is computed adaptively; it depends on the past of the trajectory of the process through some time-averages. We give a detailed and general construction of such methods. We prove the consistency of the approach (almost sure convergence of well-chosen weighted empirical probability distribution). We justify the efficiency thanks to several qualitative and quantitative additional arguments. To prove these results , we revisit and extend tools from stochastic approximation applied to self-interacting diffusions, in an original context

    Ergodicity of inhomogeneous Markov chains through asymptotic pseudotrajectories

    Get PDF
    In this work, we consider an inhomogeneous (discrete time) Markov chain and are interested in its long time behavior. We provide sufficient conditions to ensure that some of its asymptotic properties can be related to the ones of a homogeneous (continuous time) Markov process. Renowned examples such as a bandit algorithms, weighted random walks or decreasing step Euler schemes are included in our framework. Our results are related to functional limit theorems, but the approach differs from the standard "Tightness/Identification" argument; our method is unified and based on the notion of pseudotrajectories on the space of probability measures

    Strongly Vertex-Reinforced-Random-Walk on the complete graph

    Full text link
    We study Vertex-Reinforced-Random-Walk on the complete graph with weights of the form w(n)=nαw(n)=n^\alpha, with α>1\alpha>1. Unlike for the Edge-Reinforced-Random-Walk, which in this case localizes a.s. on 2 sites, here we observe various phase transitions, and in particular localization on arbitrary large sets is possible, provided α\alpha is close enough to 1. Our proof relies on stochastic approximation techniques. At the end of the paper, we also prove a general result ensuring that any strongly reinforced VRRW on any bounded degree graph localizes a.s. on a finite subgraph.Comment: 19 p

    Smale Strategies for Network Prisoner's Dilemma Games

    Full text link
    Smale's approach \cite{Smale80} to the classical two-players repeated Prisoner's Dilemma game is revisited here for NN-players and Network games in the framework of Blackwell's approachability, stochastic approximations and differential inclusions

    Stochastic Approximations and Differential Inclusions

    Get PDF
    L'approche en termes de systèmes dynamiques de l'approximation stochastique est étendue au cas ou l'équation différentielle moyenne est remplacée par une inclusion différentielle. Le théorème de Benaim et Hirsch sur l'ensemble limite est étendu a ce cas. On étudie en détail les ensembles ICT et les attracteurs. On donne des applications a des questions de théorie des jeux, en particulier concernant le théorème d'approchabilite de Blackwell et la convergence de "fictitious play".Approximation stochastique;Système dynamique multivalue
    corecore