Search CORE

1 research outputs found

Policy Transfer Methods in RoboCup Keep-Away

Author: Ammar H.
Didi S.
Doncleux S.
Moshaiov A.
Stone P.
Taylor M.
Taylor M.
Taylor M.
Verbancsics P.
Whiteson S.
Publication venue
Publication date: 01/01/2018
Field of study

This study investigates multi-agent policy transfer coupled with behavior adaptation by objective and non-objective search variants of HyperNEAT in RoboCup keep-away. For comparison, evolved behaviors were compared to those adapted by RL methods: SARSA and Q-Learning, coupled with policy transfer. Keepaway was selected as it is an established multi-agent experimental platform. Similarly, the SARSA and Q-Learning methods were selected as both have been demonstrated for boosting behavior quality with policy transfer. Keep-away behaviors were gauged in terms of effectiveness and efficiency. Effectiveness was average task performance given policy transfer, where task performance was average ball control time by the keeper team. Efficiency was average number of evaluations taken to reach a minimum task performance threshold given policy transfer

Crossref

UCT Computer Science Research Document Archive