Auto-Encoding Adversarial Imitation Learning

Gao, Yang; Zhang, Kaifeng; Zhang, Ziming; Zhao, Rui

Auto-Encoding Adversarial Imitation Learning

Authors: Yang Gao
Kaifeng Zhang
Ziming Zhang
Rui Zhao
Publication date: 8 August 2023
Publisher

Abstract

Reinforcement learning (RL) provides a powerful framework for decision-making, but its application in practice often requires a carefully designed reward function. Adversarial Imitation Learning (AIL) sheds light on automatic policy acquisition without access to the reward signal from the environment. In this work, we propose Auto-Encoding Adversarial Imitation Learning (AEAIL), a robust and scalable AIL framework. To induce expert policies from demonstrations, AEAIL utilizes the reconstruction error of an auto-encoder as a reward signal, which provides more information for optimizing policies than the prior discriminator-based ones. Subsequently, we use the derived objective functions to train the auto-encoder and the agent policy. Experiments show that our AEAIL performs superior compared to state-of-the-art methods on both state and image based environments. More importantly, AEAIL shows much better robustness when the expert demonstrations are noisy.Comment: 15 page

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2206.11004

Last time updated on 12/08/2023