4,932 research outputs found

    Multi-Agent Credit Assignment in Stochastic Resource Management Games

    Get PDF
    Multi-Agent Systems (MAS) are a form of distributed intelligence, where multiple autonomous agents act in a common environment. Numerous complex, real world systems have been successfully optimised using Multi-Agent Reinforcement Learning (MARL) in conjunction with the MAS framework. In MARL agents learn by maximising a scalar reward signal from the environment, and thus the design of the reward function directly affects the policies learned. In this work, we address the issue of appropriate multi-agent credit assignment in stochastic resource management games. We propose two new Stochastic Games to serve as testbeds for MARL research into resource management problems: the Tragic Commons Domain and the Shepherd Problem Domain. Our empirical work evaluates the performance of two commonly used reward shaping techniques: Potential-Based Reward Shaping and difference rewards. Experimental results demonstrate that systems using appropriate reward shaping techniques for multi-agent credit assignment can achieve near optimal performance in stochastic resource management games, outperforming systems learning using unshaped local or global evaluations. We also present the first empirical investigations into the effect of expressing the same heuristic knowledge in state- or action-based formats, therefore developing insights into the design of multi-agent potential functions that will inform future work

    Multiagent Deep Reinforcement Learning: Challenges and Directions Towards Human-Like Approaches

    Full text link
    This paper surveys the field of multiagent deep reinforcement learning. The combination of deep neural networks with reinforcement learning has gained increased traction in recent years and is slowly shifting the focus from single-agent to multiagent environments. Dealing with multiple agents is inherently more complex as (a) the future rewards depend on the joint actions of multiple players and (b) the computational complexity of functions increases. We present the most common multiagent problem representations and their main challenges, and identify five research areas that address one or more of these challenges: centralised training and decentralised execution, opponent modelling, communication, efficient coordination, and reward shaping. We find that many computational studies rely on unrealistic assumptions or are not generalisable to other settings; they struggle to overcome the curse of dimensionality or nonstationarity. Approaches from psychology and sociology capture promising relevant behaviours such as communication and coordination. We suggest that, for multiagent reinforcement learning to be successful, future research addresses these challenges with an interdisciplinary approach to open up new possibilities for more human-oriented solutions in multiagent reinforcement learning.Comment: 37 pages, 6 figure

    Metadata Schema x-econ Repository

    Get PDF
    Since May 2017, the x-hub project partners OVGU Magdeburg, University of Vienna, and GESIS dispose of a new repository, called x-econ (https://x-econ.org). The service is dedicated to all experimental economics research projects to disseminate user-friendly archiving and provision of experimental economics research data. The repository x-econ contains all necessary core functionalities of a modern repository and is in a continuous optimization process aiming at functionality enhancement and improvement. x-econ is also one pillar of the multidisciplinary repository x-science (https://x-science.org). The present documentation, which is primarily based on the GESIS Technical Reports on datorium 2014|03 and da|ra 4.0, lists and explains the metadata elements, used to describe research information

    Towards Cooperative MARL in Industrial Domains

    Get PDF

    Preface to the special issue: adaptive and learning agents

    Get PDF

    Hierarchical multiagent reinforcement learning for maritime traffic management

    Get PDF
    Agency for Science, Technology and Research, Fujitsu Limited; National Research Foundation Singapor
    corecore