Search CORE

1 research outputs found

Hierarchical Reinforcement Learning with Abductive Planning

Author: Onishi Takashi
Tsuruoka Yoshimasa
Yamamoto Kazeto
Publication venue
Publication date: 28/06/2018
Field of study

One of the key challenges in applying reinforcement learning to real-life problems is that the amount of train-and-error required to learn a good policy increases drastically as the task becomes complex. One potential solution to this problem is to combine reinforcement learning with automated symbol planning and utilize prior knowledge on the domain. However, existing methods have limitations in their applicability and expressiveness. In this paper we propose a hierarchical reinforcement learning method based on abductive symbolic planning. The planner can deal with user-defined evaluation functions and is not based on the Herbrand theorem. Therefore it can utilize prior knowledge of the rewards and can work in a domain where the state space is unknown. We demonstrate empirically that our architecture significantly improves learning efficiency with respect to the amount of training examples on the evaluation domain, in which the state space is unknown and there exist multiple goals.Comment: 7 pages, 6 figures, ICML/IJCAI/AAMAS 2018 Workshop on Planning and Learning (PAL-18

arXiv.org e-Print Archive