Search CORE

1,687 research outputs found

Multiagent Learning in Large Anonymous Games

Author: Friedman Eric J.
Halpern Joseph Y.
Kash Ian A.
Publication venue
Publication date: 01/01/2009
Field of study

In large systems, it is important for agents to learn to act effectively, but sophisticated multi-agent learning algorithms generally do not scale. An alternative approach is to find restricted classes of games where simple, efficient algorithms converge. It is shown that stage learning efficiently converges to Nash equilibria in large anonymous games if best-reply dynamics converge. Two features are identified that improve convergence. First, rather than making learning more difficult, more agents are actually beneficial in many settings. Second, providing agents with statistical information about the behavior of others can significantly reduce the number of observations needed.Comment: 8 pages, 2 figures. To Appear in Proceedings of the Eighth International Conference on Autonomous Agents and Multiagent System

arXiv.org e-Print Archive

CiteSeerX

Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence

Author: Aryan Abi
Ding Zihan
Lukasiewicz Thomas
Song Yuhang
Wang Jianyi
Wojcicki Andrzej
Wu Lianlong
Xu Mai
Xu Zhenghua
Publication venue
Publication date: 27/11/2019
Field of study

Learning agents that are not only capable of taking tests, but also innovating is becoming a hot topic in AI. One of the most promising paths towards this vision is multi-agent learning, where agents act as the environment for each other, and improving each agent means proposing new problems for others. However, existing evaluation platforms are either not compatible with multi-agent settings, or limited to a specific game. That is, there is not yet a general evaluation platform for research on multi-agent intelligence. To this end, we introduce Arena, a general evaluation platform for multi-agent intelligence with 35 games of diverse logics and representations. Furthermore, multi-agent intelligence is still at the stage where many problems remain unexplored. Therefore, we provide a building toolkit for researchers to easily invent and build novel multi-agent problems from the provided game set based on a GUI-configurable social tree and five basic multi-agent reward schemes. Finally, we provide Python implementations of five state-of-the-art deep multi-agent reinforcement learning baselines. Along with the baseline implementations, we release a set of 100 best agents/teams that we can train with different training schemes for each game, as the base for evaluating agents with population performance. As such, the research community can perform comparisons under a stable and uniform standard. All the implementations and accompanied tutorials have been open-sourced for the community at https://sites.google.com/view/arena-unity/

arXiv.org e-Print Archive

Oxford University Research Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Payoff-Based Dynamics for Multiplayer Weakly Acyclic Games

Author: Arslan Gürdal
Marden Jason R.
Shamma Jeff S.
Young H. Peyton
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2007
Field of study

We consider repeated multiplayer games in which players repeatedly and simultaneously choose strategies from a finite set of available strategies according to some strategy adjustment process. We focus on the specific class of weakly acyclic games, which is particularly relevant for multiagent cooperative control problems. A strategy adjustment process determines how players select their strategies at any stage as a function of the information gathered over previous stages. Of particular interest are “payoff-based” processes in which, at any stage, players know only their own actions and (noise corrupted) payoffs from previous stages. In particular, players do not know the actions taken by other players and do not know the structural form of payoff functions. We introduce three different payoff-based processes for increasingly general scenarios and prove that, after a sufficiently large number of stages, player actions constitute a Nash equilibrium at any stage with arbitrarily high probability. We also show how to modify player utility functions through tolls and incentives in so-called congestion games, a special class of weakly acyclic games, to guarantee that a centralized objective can be realized as a Nash equilibrium. We illustrate the methods with a simulation of distributed routing over a network

CiteSeerX

Crossref

Caltech Authors

Oxford University Research Archive