Hysteretic Q-Learning : an algorithm for decentralized reinforcement learning in cooperative multi-agent teams.

Laurent, Guillaume,; Le Fort-Piat, Nadine; Matignon, Laëtitia

research

Hysteretic Q-Learning : an algorithm for decentralized reinforcement learning in cooperative multi-agent teams.

Authors: Guillaume, Laurent
Nadine Le Fort-Piat
Laëtitia Matignon
Publication date: 29 October 2007
Publisher: HAL CCSD
Doi

Abstract

International audienceMulti-agent systems (MAS) are a field of study of growing interest in a variety of domains such as robotics or distributed controls. The article focuses on decentralized reinforcement learning (RL) in cooperative MAS, where a team of independent learning robot (IL) try to coordinate their individual behavior to reach a coherent joint behavior. We assume that each robot has no information about its teammates'actions. To date, RL approaches for such ILs did not guarantee convergence to the optimal joint policy in scenarios where the coordination is difficult. We report an investigation of existing algorithms for the learning of coordination in cooperative MAS, and suggest a Q-Learning extension for ILs, called Hysteretic Q-Learning. This algorithm does not require any additional communication between robots. Its advantages are showing off and compared to other methods on various applications : bimatrix games, collaborative ball balancing task and pursuit domain

Similar works

Full text

Available Versions

HAL - Université de Franche-Comté

oai:HAL:hal-00187279v1

Last time updated on 12/11/2016

Crossref

Last time updated on 01/04/2019