2,879 research outputs found

    No-regret Dynamics and Fictitious Play

    Full text link
    Potential based no-regret dynamics are shown to be related to fictitious play. Roughly, these are epsilon-best reply dynamics where epsilon is the maximal regret, which vanishes with time. This allows for alternative and sometimes much shorter proofs of known results on convergence of no-regret dynamics to the set of Nash equilibria

    Axiomatic Approach to Solutions of Games

    Full text link
    We consider solutions of normal form games that are invariant under strategic equivalence. We consider additional properties that can be expected (or be desired) from a solution of a game, and we observe the following: - Even the weakest notion of individual rationality restricts the set of solutions to be equilibria. This observation holds for all types of solutions: in pure-strategies, in mixed strategies, and in correlated strategies where the corresponding notions of equilibria are pure-Nash, Nash and coarse-correlated. An action profile is (strict) simultaneous maximizer if it simultaneously globally (strictly) maximizes the payoffs of all players. - If we require that a simultaneous maximizer (if it exists) will be a solution, then the solution contains the set of pure Nash equilibria. - There is no solution for which a strict simultaneous maximizer (if it exists) is the unique solution

    Cycles in adversarial regularized learning

    Get PDF
    Regularized learning is a fundamental technique in online optimization, machine learning and many other fields of computer science. A natural question that arises in these settings is how regularized learning algorithms behave when faced against each other. We study a natural formulation of this problem by coupling regularized learning dynamics in zero-sum games. We show that the system's behavior is Poincar\'e recurrent, implying that almost every trajectory revisits any (arbitrarily small) neighborhood of its starting point infinitely often. This cycling behavior is robust to the agents' choice of regularization mechanism (each agent could be using a different regularizer), to positive-affine transformations of the agents' utilities, and it also persists in the case of networked competition, i.e., for zero-sum polymatrix games.Comment: 22 pages, 4 figure

    Learning Equilibria with Partial Information in Decentralized Wireless Networks

    Full text link
    In this article, a survey of several important equilibrium concepts for decentralized networks is presented. The term decentralized is used here to refer to scenarios where decisions (e.g., choosing a power allocation policy) are taken autonomously by devices interacting with each other (e.g., through mutual interference). The iterative long-term interaction is characterized by stable points of the wireless network called equilibria. The interest in these equilibria stems from the relevance of network stability and the fact that they can be achieved by letting radio devices to repeatedly interact over time. To achieve these equilibria, several learning techniques, namely, the best response dynamics, fictitious play, smoothed fictitious play, reinforcement learning algorithms, and regret matching, are discussed in terms of information requirements and convergence properties. Most of the notions introduced here, for both equilibria and learning schemes, are illustrated by a simple case study, namely, an interference channel with two transmitter-receiver pairs.Comment: 16 pages, 5 figures, 1 table. To appear in IEEE Communication Magazine, special Issue on Game Theor

    Iterated Strict Dominance in General Games

    Get PDF
    We offer a definition of iterated elimination of strictly dominated strategies (IESDS) for games with (in)finite players, (non)compact strategy sets, and (dis)continuous payoff functions. IESDS is always a well-defined order independent procedure that can be used to solve Nash equilibrium in dominance-solvable games. We characterize IESDS by means of a "stability" criterion, and offer a sufficient and necessary epistemic condition for IESDS. We show by an example that IESDS may generate spurious Nash equilibria in the class of Reny's better-reply secure games. We provide sufficient/necessary conditions under which IESDS preserves the set of Nash equilibria. Nous donnons une définition de l’élimination itérative des stratégies qui sont strictement donimées (EISSD) pour les jeux avec un nombre fini (ou infini) de joueurs , des ensembles de stratégies compactes (ou non-compactes), et des fonctions de gains continues (ou non-continues). Le processus EISSD est bien défini et indépendant de l’ordre d’élimination. Nous donnons une caractérisation du processus EISSD en utilisant un critère de stabilité et offrons une condition épistémologique. Nous démontrons que le processus EISSD peut produire des équilibres faux dans la classe des jeux de meilleures réponses sécuritaires de Reny. Nous donnons des conditions nécessaires et suffisantes pour que le processus EISSD conserve l’ensemble des équilibre de Nash.game theory, strict dominance, iterated elimination, Nash equilibrium, Reny's better-reply secure games., théorie des jeux, dominance stricte, élimination itérative, équilibre de Nash, jeux de meilleures réponses sécuritaires de Reny
    • …
    corecore