2,879 research outputs found
No-regret Dynamics and Fictitious Play
Potential based no-regret dynamics are shown to be related to fictitious
play. Roughly, these are epsilon-best reply dynamics where epsilon is the
maximal regret, which vanishes with time. This allows for alternative and
sometimes much shorter proofs of known results on convergence of no-regret
dynamics to the set of Nash equilibria
Axiomatic Approach to Solutions of Games
We consider solutions of normal form games that are invariant under strategic
equivalence. We consider additional properties that can be expected (or be
desired) from a solution of a game, and we observe the following:
- Even the weakest notion of individual rationality restricts the set of
solutions to be equilibria. This observation holds for all types of solutions:
in pure-strategies, in mixed strategies, and in correlated strategies where the
corresponding notions of equilibria are pure-Nash, Nash and coarse-correlated.
An action profile is (strict) simultaneous maximizer if it simultaneously
globally (strictly) maximizes the payoffs of all players.
- If we require that a simultaneous maximizer (if it exists) will be a
solution, then the solution contains the set of pure Nash equilibria.
- There is no solution for which a strict simultaneous maximizer (if it
exists) is the unique solution
Cycles in adversarial regularized learning
Regularized learning is a fundamental technique in online optimization,
machine learning and many other fields of computer science. A natural question
that arises in these settings is how regularized learning algorithms behave
when faced against each other. We study a natural formulation of this problem
by coupling regularized learning dynamics in zero-sum games. We show that the
system's behavior is Poincar\'e recurrent, implying that almost every
trajectory revisits any (arbitrarily small) neighborhood of its starting point
infinitely often. This cycling behavior is robust to the agents' choice of
regularization mechanism (each agent could be using a different regularizer),
to positive-affine transformations of the agents' utilities, and it also
persists in the case of networked competition, i.e., for zero-sum polymatrix
games.Comment: 22 pages, 4 figure
Learning Equilibria with Partial Information in Decentralized Wireless Networks
In this article, a survey of several important equilibrium concepts for
decentralized networks is presented. The term decentralized is used here to
refer to scenarios where decisions (e.g., choosing a power allocation policy)
are taken autonomously by devices interacting with each other (e.g., through
mutual interference). The iterative long-term interaction is characterized by
stable points of the wireless network called equilibria. The interest in these
equilibria stems from the relevance of network stability and the fact that they
can be achieved by letting radio devices to repeatedly interact over time. To
achieve these equilibria, several learning techniques, namely, the best
response dynamics, fictitious play, smoothed fictitious play, reinforcement
learning algorithms, and regret matching, are discussed in terms of information
requirements and convergence properties. Most of the notions introduced here,
for both equilibria and learning schemes, are illustrated by a simple case
study, namely, an interference channel with two transmitter-receiver pairs.Comment: 16 pages, 5 figures, 1 table. To appear in IEEE Communication
Magazine, special Issue on Game Theor
Iterated Strict Dominance in General Games
We offer a definition of iterated elimination of strictly dominated strategies (IESDS) for games with (in)finite players, (non)compact strategy sets, and (dis)continuous payoff functions. IESDS is always a well-defined order independent procedure that can be used to solve Nash equilibrium in dominance-solvable games. We characterize IESDS by means of a "stability" criterion, and offer a sufficient and necessary epistemic condition for IESDS. We show by an example that IESDS may generate spurious Nash equilibria in the class of Reny's better-reply secure games. We provide sufficient/necessary conditions under which IESDS preserves the set of Nash equilibria. Nous donnons une définition de l’élimination itérative des stratégies qui sont strictement donimées (EISSD) pour les jeux avec un nombre fini (ou infini) de joueurs , des ensembles de stratégies compactes (ou non-compactes), et des fonctions de gains continues (ou non-continues). Le processus EISSD est bien défini et indépendant de l’ordre d’élimination. Nous donnons une caractérisation du processus EISSD en utilisant un critère de stabilité et offrons une condition épistémologique. Nous démontrons que le processus EISSD peut produire des équilibres faux dans la classe des jeux de meilleures réponses sécuritaires de Reny. Nous donnons des conditions nécessaires et suffisantes pour que le processus EISSD conserve l’ensemble des équilibre de Nash.game theory, strict dominance, iterated elimination, Nash equilibrium, Reny's better-reply secure games., théorie des jeux, dominance stricte, élimination itérative, équilibre de Nash, jeux de meilleures réponses sécuritaires de Reny
- …