Search CORE

691 research outputs found

Multiagent Deep Reinforcement Learning: Challenges and Directions Towards Human-Like Approaches

Author: Bäck Thomas
Kononova Anna V.
Plaat Aske
Wong Annie
Publication venue
Publication date: 29/06/2021
Field of study

This paper surveys the field of multiagent deep reinforcement learning. The combination of deep neural networks with reinforcement learning has gained increased traction in recent years and is slowly shifting the focus from single-agent to multiagent environments. Dealing with multiple agents is inherently more complex as (a) the future rewards depend on the joint actions of multiple players and (b) the computational complexity of functions increases. We present the most common multiagent problem representations and their main challenges, and identify five research areas that address one or more of these challenges: centralised training and decentralised execution, opponent modelling, communication, efficient coordination, and reward shaping. We find that many computational studies rely on unrealistic assumptions or are not generalisable to other settings; they struggle to overcome the curse of dimensionality or nonstationarity. Approaches from psychology and sociology capture promising relevant behaviours such as communication and coordination. We suggest that, for multiagent reinforcement learning to be successful, future research addresses these challenges with an interdisciplinary approach to open up new possibilities for more human-oriented solutions in multiagent reinforcement learning.Comment: 37 pages, 6 figure

arXiv.org e-Print Archive

Multi-Agent Credit Assignment in Stochastic Resource Management Games

Author: Arthur
Binmore
Enda Howley
Jim Duggan
Patrick Mannion
Sam Devlin
Wolpert
Wooldridge
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2017
Field of study

Multi-Agent Systems (MAS) are a form of distributed intelligence, where multiple autonomous agents act in a common environment. Numerous complex, real world systems have been successfully optimised using Multi-Agent Reinforcement Learning (MARL) in conjunction with the MAS framework. In MARL agents learn by maximising a scalar reward signal from the environment, and thus the design of the reward function directly affects the policies learned. In this work, we address the issue of appropriate multi-agent credit assignment in stochastic resource management games. We propose two new Stochastic Games to serve as testbeds for MARL research into resource management problems: the Tragic Commons Domain and the Shepherd Problem Domain. Our empirical work evaluates the performance of two commonly used reward shaping techniques: Potential-Based Reward Shaping and difference rewards. Experimental results demonstrate that systems using appropriate reward shaping techniques for multi-agent credit assignment can achieve near optimal performance in stochastic resource management games, outperforming systems learning using unshaped local or global evaluations. We also present the first empirical investigations into the effect of expressing the same heuristic knowledge in state- or action-based formats, therefore developing insights into the design of multi-agent potential functions that will inform future work

Crossref

Irish Universities

White Rose Research Online

Access to Research at National University of Ireland, Galway

MULTIAGENT LEARNING FOR BLACK BOX SYSTEM REWARD FUNCTIONS

Author: Bagnell J. A.
Bilimoria K. D.
Dietterich T. G.
Jefferies P.
Jennings N. R.
McGlohon M.
Parkes D.
Stone P.
Tuyls K.
Whiteson S.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study

Crossref