Multiagent Reinforcement Learning with Regret Matching for Robot Soccer

Jiachen Ma; Qiang Liu; Wei Xie

Multiagent Reinforcement Learning with Regret Matching for Robot Soccer

Authors: Jiachen Ma
Qiang Liu
Wei Xie
Publication date: 1 January 2013
Publisher: 'Hindawi Limited'
Doi

Abstract

This paper proposes a novel multiagent reinforcement learning (MARL) algorithm Nash- learning with regret matching, in which regret matching is used to speed up the well-known MARL algorithm Nash- learning. It is critical that choosing a suitable strategy for action selection to harmonize the relation between exploration and exploitation to enhance the ability of online learning for Nash- learning. In Markov Game the joint action of agents adopting regret matching algorithm can converge to a group of points of no-regret that can be viewed as coarse correlated equilibrium which includes Nash equilibrium in essence. It is can be inferred that regret matching can guide exploration of the state-action space so that the rate of convergence of Nash- learning algorithm can be increased. Simulation results on robot soccer validate that compared to original Nash- learning algorithm, the use of regret matching during the learning phase of Nash- learning has excellent ability of online learning and results in significant performance in terms of scores, average reward and policy convergence

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Directory of Open Access Journals

oai:doaj.org/article:89cbebb32...

Last time updated on 17/12/2014

Crossref

info:doi/10.1155%2F2013%2F9262...

Last time updated on 01/04/2019