Search CORE

679 research outputs found

Approximating n-player behavioural strategy nash equilibria using coevolution

Author: Lucas Simon
Samothrakis Spyridon
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2011
Field of study

Coevolutionary algorithms are plagued with a set of problems related to intransitivity that make it questionable what the end product of a coevolutionary run can achieve. With the introduction of solution concepts into coevolution, part of the issue was alleviated, however efficiently representing and achieving game theoretic solution concepts is still not a trivial task. In this paper we propose a coevolutionary algorithm that approximates behavioural strategy Nash equilibria in n-player zero sum games, by exploiting the minimax solution concept. In order to support our case we provide a set of experiments in both games of known and unknown equilibria. In the case of known equilibria, we can confirm our algorithm converges to the known solution, while in the case of unknown equilibria we can see a steady progress towards Nash. Copyright 2011 ACM

University of Essex Research Repository

Crossref

Can game theory be saved?

Author: Flavio Menezes
John Quiggin
Publication venue
Publication date
Field of study

Game-theoretic analysis is a well-established part of the toolkit of economic analysis. In crucial respects, however, game theory has failed to deliver on its original promise of generating sharp predictions of behavior in situations where neoclassical microeconomics has little to say. Experience has shown that in most situations, it is possible to tell a game-theoretic story to fit almost any possible outcome. We argue that, in general, any individually rational outcome of an economic interaction may be supported as the Nash equilibrium of an appropriately chosen game, and that a wide range of these outcomes will have an economically reasonable interpretation. We consider possible attempts to salvage the original objectives of the game-theoretic research program. In at least some cases, information on institutional structures and observations of interactions between agents can be used to limit the set of strategies that may be considered reasonable.game theory, equilibrium

Research Papers in Economics

A Generalised Method for Empirical Game Theoretic Analysis

Author: Graepel Thore
Lanctot Marc
Leibo Joel Z
Perolat Julien
Tuyls Karl
Publication venue
Publication date: 16/03/2018
Field of study

This paper provides theoretical bounds for empirical game theoretical analysis of complex multi-agent interactions. We provide insights in the empirical meta game showing that a Nash equilibrium of the meta-game is an approximate Nash equilibrium of the true underlying game. We investigate and show how many data samples are required to obtain a close enough approximation of the underlying game. Additionally, we extend the meta-game analysis methodology to asymmetric games. The state-of-the-art has only considered empirical games in which agents have access to the same strategy sets and the payoff structure is symmetric, implying that agents are interchangeable. Finally, we carry out an empirical illustration of the generalised method in several domains, illustrating the theory and evolutionary dynamics of several versions of the AlphaGo algorithm (symmetric), the dynamics of the Colonel Blotto game played by human players on Facebook (symmetric), and an example of a meta-game in Leduc Poker (asymmetric), generated by the PSRO multi-agent learning algorithm.Comment: will appear at AAMAS'1

arXiv.org e-Print Archive

UCL Discovery

Bounds and dynamics for empirical game theoretic analysis

Author: Everett R
Graepel T
Hughes E
Lanctot M
Leibo JZ
Pérolat J
Szepesvári C
Tuyls K
Publication venue
Publication date: 04/12/2019
Field of study

This paper provides several theoretical results for empirical game theory. Specifically, we introduce bounds for empirical game theoretical analysis of complex multi-agent interactions. In doing so we provide insights in the empirical meta game showing that a Nash equilibrium of the estimated meta-game is an approximate Nash equilibrium of the true underlying meta-game. We investigate and show how many data samples are required to obtain a close enough approximation of the underlying game. Additionally, we extend the evolutionary dynamics analysis of meta-games using heuristic payoff tables (HPTs) to asymmetric games. The state-of-the-art has only considered evolutionary dynamics of symmetric HPTs in which agents have access to the same strategy sets and the payoff structure is symmetric, implying that agents are interchangeable. Finally, we carry out an empirical illustration of the generalised method in several domains, illustrating the theory and evolutionary dynamics of several versions of the AlphaGo algorithm (symmetric), the dynamics of the Colonel Blotto game played by human players on Facebook (symmetric), the dynamics of several teams of players in the capture the flag game (symmetric), and an example of a meta-game in Leduc Poker (asymmetric), generated by the policy-space response oracle multi-agent learning algorithm

UCL Discovery

A comparative study of game theoretic and evolutionary models for software agents

Author: Fatima S.
Jennings N.R.
Wooldridge M.
Publication venue
Publication date: 01/04/2004
Field of study

Most of the existing work in the study of bargaining behaviour uses techniques from game theory. Game theoretic models for bargaining assume that players are perfectly rational and that this rationality in common knowledge. However, the perfect rationality assumption does not hold for real-life bargaining scenarios with humans as players, since results from experimental economics show that humans find their way to the best strategy through trial and error, and not typically by means of rational deliberation. Such players are said to be boundedly rational. In playing a game against an opponent with bounded rationality, the most effective strategy of a player is not the equilibrium strategy but the one that is the best reply to the opponent's strategy. The evolutionary model provides a means for studying the bargaining behaviour of boundedly rational players. This paper provides a comprehensive comparison of the game theoretic and evolutionary approaches to bargaining by examining their assumptions, goals, and limitations. We then study the implications of these differences from the perspective of the software agent developer

CiteSeerX

Southampton (e-Prints Soton)

Spiral - Imperial College Digital Repository

A Generalized Training Approach for Multiagent Learning

Author: Graepel Thore
Heess Nicolas
Hennes Daniel
Hughes Edward
Lanctot Marc
Lever Guy
Liu Siqi
Marris Luke
Muller Paul
Munos Remi
Omidshafiei Shayegan
Perolat Julien
Rowland Mark
Tuyls Karl
Wang Zhe
Publication venue
Publication date: 01/01/2020
Field of study

This paper investigates a population-based training regime based on game-theoretic principles called Policy-Spaced Response Oracles (PSRO). PSRO is general in the sense that it (1) encompasses well-known algorithms such as fictitious play and double oracle as special cases, and (2) in principle applies to general-sum, many-player games. Despite this, prior studies of PSRO have been focused on two-player zero-sum games, a regime wherein Nash equilibria are tractably computable. In moving from two-player zero-sum games to more general settings, computation of Nash equilibria quickly becomes infeasible. Here, we extend the theoretical underpinnings of PSRO by considering an alternative solution concept,

\alpha

-Rank, which is unique (thus faces no equilibrium selection issues, unlike Nash) and applies readily to general-sum, many-player settings. We establish convergence guarantees in several games classes, and identify links between Nash equilibria and

\alpha

-Rank. We demonstrate the competitive performance of

\alpha

-Rank-based PSRO against an exact Nash solver-based PSRO in 2-player Kuhn and Leduc Poker. We then go beyond the reach of prior PSRO applications by considering 3- to 5-player poker games, yielding instances where

\alpha

-Rank achieves faster convergence than approximate Nash solvers, thus establishing it as a favorable general games solver. We also carry out an initial empirical validation in MuJoCo soccer, illustrating the feasibility of the proposed approach in another complex domain

arXiv.org e-Print Archive

UCL Discovery