679 research outputs found
Approximating n-player behavioural strategy nash equilibria using coevolution
Coevolutionary algorithms are plagued with a set of problems related to intransitivity that make it questionable what the end product of a coevolutionary run can achieve. With the introduction of solution concepts into coevolution, part of the issue was alleviated, however efficiently representing and achieving game theoretic solution concepts is still not a trivial task. In this paper we propose a coevolutionary algorithm that approximates behavioural strategy Nash equilibria in n-player zero sum games, by exploiting the minimax solution concept. In order to support our case we provide a set of experiments in both games of known and unknown equilibria. In the case of known equilibria, we can confirm our algorithm converges to the known solution, while in the case of unknown equilibria we can see a steady progress towards Nash. Copyright 2011 ACM
Can game theory be saved?
Game-theoretic analysis is a well-established part of the toolkit of economic analysis. In crucial respects, however, game theory has failed to deliver on its original promise of generating sharp predictions of behavior in situations where neoclassical microeconomics has little to say. Experience has shown that in most situations, it is possible to tell a game-theoretic story to fit almost any possible outcome. We argue that, in general, any individually rational outcome of an economic interaction may be supported as the Nash equilibrium of an appropriately chosen game, and that a wide range of these outcomes will have an economically reasonable interpretation. We consider possible attempts to salvage the original objectives of the game-theoretic research program. In at least some cases, information on institutional structures and observations of interactions between agents can be used to limit the set of strategies that may be considered reasonable.game theory, equilibrium
A Generalised Method for Empirical Game Theoretic Analysis
This paper provides theoretical bounds for empirical game theoretical
analysis of complex multi-agent interactions. We provide insights in the
empirical meta game showing that a Nash equilibrium of the meta-game is an
approximate Nash equilibrium of the true underlying game. We investigate and
show how many data samples are required to obtain a close enough approximation
of the underlying game. Additionally, we extend the meta-game analysis
methodology to asymmetric games. The state-of-the-art has only considered
empirical games in which agents have access to the same strategy sets and the
payoff structure is symmetric, implying that agents are interchangeable.
Finally, we carry out an empirical illustration of the generalised method in
several domains, illustrating the theory and evolutionary dynamics of several
versions of the AlphaGo algorithm (symmetric), the dynamics of the Colonel
Blotto game played by human players on Facebook (symmetric), and an example of
a meta-game in Leduc Poker (asymmetric), generated by the PSRO multi-agent
learning algorithm.Comment: will appear at AAMAS'1
Bounds and dynamics for empirical game theoretic analysis
This paper provides several theoretical results for empirical game theory. Specifically, we introduce bounds for empirical game theoretical analysis of complex multi-agent interactions. In doing so we provide insights in the empirical meta game showing that a Nash equilibrium of the estimated meta-game is an approximate Nash equilibrium of the true underlying meta-game. We investigate and show how many data samples are required to obtain a close enough approximation of the underlying game. Additionally, we extend the evolutionary dynamics analysis of meta-games using heuristic payoff tables (HPTs) to asymmetric games. The state-of-the-art has only considered evolutionary dynamics of symmetric HPTs in which agents have access to the same strategy sets and the payoff structure is symmetric, implying that agents are interchangeable. Finally, we carry out an empirical illustration of the generalised method in several domains, illustrating the theory and evolutionary dynamics of several versions of the AlphaGo algorithm (symmetric), the dynamics of the Colonel Blotto game played by human players on Facebook (symmetric), the dynamics of several teams of players in the capture the flag game (symmetric), and an example of a meta-game in Leduc Poker (asymmetric), generated by the policy-space response oracle multi-agent learning algorithm
A comparative study of game theoretic and evolutionary models for software agents
Most of the existing work in the study of bargaining behaviour uses techniques from game theory. Game theoretic models for bargaining assume that players are perfectly rational and that this rationality in common knowledge. However, the perfect rationality assumption does not hold for real-life bargaining scenarios with humans as players, since results from experimental economics show that humans find their way to the best strategy through trial and error, and not typically by means of rational deliberation. Such players are said to be boundedly rational. In playing a game against an opponent with bounded rationality, the most effective strategy of a player is not the equilibrium strategy but the one that is the best reply to the opponent's strategy. The evolutionary model provides a means for studying the bargaining behaviour of boundedly rational players. This paper provides a comprehensive comparison of the game theoretic and evolutionary approaches to bargaining by examining their assumptions, goals, and limitations. We then study the implications of these differences from the perspective of the software agent developer
A Generalized Training Approach for Multiagent Learning
This paper investigates a population-based training regime based on
game-theoretic principles called Policy-Spaced Response Oracles (PSRO). PSRO is
general in the sense that it (1) encompasses well-known algorithms such as
fictitious play and double oracle as special cases, and (2) in principle
applies to general-sum, many-player games. Despite this, prior studies of PSRO
have been focused on two-player zero-sum games, a regime wherein Nash
equilibria are tractably computable. In moving from two-player zero-sum games
to more general settings, computation of Nash equilibria quickly becomes
infeasible. Here, we extend the theoretical underpinnings of PSRO by
considering an alternative solution concept, -Rank, which is unique
(thus faces no equilibrium selection issues, unlike Nash) and applies readily
to general-sum, many-player settings. We establish convergence guarantees in
several games classes, and identify links between Nash equilibria and
-Rank. We demonstrate the competitive performance of
-Rank-based PSRO against an exact Nash solver-based PSRO in 2-player
Kuhn and Leduc Poker. We then go beyond the reach of prior PSRO applications by
considering 3- to 5-player poker games, yielding instances where -Rank
achieves faster convergence than approximate Nash solvers, thus establishing it
as a favorable general games solver. We also carry out an initial empirical
validation in MuJoCo soccer, illustrating the feasibility of the proposed
approach in another complex domain
- …