Search CORE

2,879 research outputs found

No-regret Dynamics and Fictitious Play

Author: Andriy Zapechelnyuk
Basu
Benaïm
Benaïm
Berger
Berger
Blackwell
Brown
Cesa-Bianchi
Cesa-Bianchi
Clemen
Faure
Foster
Foster
Freund
Fudenberg
Gaunersdorfer
Gilboa
Hannan
Hart
Hart
Hart
Hofbauer
Hofbauer
Hofbauer
Hofbauer
Irani
Krishna
Larrick
Lehrer
Littlestone
Mansour
Matsui
Monderer
Moulin
Selten
Shapley
Sparrow
Timmermann
Tirole
Yannick Viossat
Yanovskaya
Young
Young
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Potential based no-regret dynamics are shown to be related to fictitious play. Roughly, these are epsilon-best reply dynamics where epsilon is the maximal regret, which vanishes with time. This allows for alternative and sometimes much shorter proofs of known results on convergence of no-regret dynamics to the set of Nash equilibria

arXiv.org e-Print Archive

CiteSeerX

Base de publications de l'université Paris-Dauphine

Crossref

Enlighten

University of St. Andrews - Pure

Rationalizability, Learning, and Equilibrium in Games with Strategic Complements

Author: John Roberts
Paul Milgrom
Publication venue
Publication date
Field of study

Research Papers in Economics

Axiomatic Approach to Solutions of Games

Author: Babichenko Yakov
Publication venue
Publication date: 20/02/2014
Field of study

We consider solutions of normal form games that are invariant under strategic equivalence. We consider additional properties that can be expected (or be desired) from a solution of a game, and we observe the following: - Even the weakest notion of individual rationality restricts the set of solutions to be equilibria. This observation holds for all types of solutions: in pure-strategies, in mixed strategies, and in correlated strategies where the corresponding notions of equilibria are pure-Nash, Nash and coarse-correlated. An action profile is (strict) simultaneous maximizer if it simultaneously globally (strictly) maximizes the payoffs of all players. - If we require that a simultaneous maximizer (if it exists) will be a solution, then the solution contains the set of pure Nash equilibria. - There is no solution for which a strict simultaneous maximizer (if it exists) is the unique solution

arXiv.org e-Print Archive

CiteSeerX

Cycles in adversarial regularized learning

Author: Mertikopoulos Panayotis
Papadimitriou Christos
Piliouras Georgios
Publication venue
Publication date: 08/09/2017
Field of study

Regularized learning is a fundamental technique in online optimization, machine learning and many other fields of computer science. A natural question that arises in these settings is how regularized learning algorithms behave when faced against each other. We study a natural formulation of this problem by coupling regularized learning dynamics in zero-sum games. We show that the system's behavior is Poincar\'e recurrent, implying that almost every trajectory revisits any (arbitrarily small) neighborhood of its starting point infinitely often. This cycling behavior is robust to the agents' choice of regularization mechanism (each agent could be using a different regularizer), to positive-affine transformations of the agents' utilities, and it also persists in the case of networked competition, i.e., for zero-sum polymatrix games.Comment: 22 pages, 4 figure

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Learning Equilibria with Partial Information in Decentralized Wireless Networks

Author: Debbah Mérouane
Lasaulce Samson
Perlaza Samir M.
Rose Luca
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/06/2011
Field of study

In this article, a survey of several important equilibrium concepts for decentralized networks is presented. The term decentralized is used here to refer to scenarios where decisions (e.g., choosing a power allocation policy) are taken autonomously by devices interacting with each other (e.g., through mutual interference). The iterative long-term interaction is characterized by stable points of the wireless network called equilibria. The interest in these equilibria stems from the relevance of network stability and the fact that they can be achieved by letting radio devices to repeatedly interact over time. To achieve these equilibria, several learning techniques, namely, the best response dynamics, fictitious play, smoothed fictitious play, reinforcement learning algorithms, and regret matching, are discussed in terms of information requirements and convergence properties. Most of the notions introduced here, for both equilibria and learning schemes, are illustrated by a simple case study, namely, an interference channel with two transmitter-receiver pairs.Comment: 16 pages, 5 figures, 1 table. To appear in IEEE Communication Magazine, special Issue on Game Theor

arXiv.org e-Print Archive

Iterated Strict Dominance in General Games

Author: Ngo Van Long
Xiao Luo
Yi-Chun Chen
Publication venue
Publication date
Field of study

We offer a definition of iterated elimination of strictly dominated strategies (IESDS) for games with (in)finite players, (non)compact strategy sets, and (dis)continuous payoff functions. IESDS is always a well-defined order independent procedure that can be used to solve Nash equilibrium in dominance-solvable games. We characterize IESDS by means of a "stability" criterion, and offer a sufficient and necessary epistemic condition for IESDS. We show by an example that IESDS may generate spurious Nash equilibria in the class of Reny's better-reply secure games. We provide sufficient/necessary conditions under which IESDS preserves the set of Nash equilibria. Nous donnons une définition de l’élimination itérative des stratégies qui sont strictement donimées (EISSD) pour les jeux avec un nombre fini (ou infini) de joueurs , des ensembles de stratégies compactes (ou non-compactes), et des fonctions de gains continues (ou non-continues). Le processus EISSD est bien défini et indépendant de l’ordre d’élimination. Nous donnons une caractérisation du processus EISSD en utilisant un critère de stabilité et offrons une condition épistémologique. Nous démontrons que le processus EISSD peut produire des équilibres faux dans la classe des jeux de meilleures réponses sécuritaires de Reny. Nous donnons des conditions nécessaires et suffisantes pour que le processus EISSD conserve l’ensemble des équilibre de Nash.game theory, strict dominance, iterated elimination, Nash equilibrium, Reny's better-reply secure games., théorie des jeux, dominance stricte, élimination itérative, équilibre de Nash, jeux de meilleures réponses sécuritaires de Reny

Research Papers in Economics