29 research outputs found
Learning STRIPS Action Models with Classical Planning
This paper presents a novel approach for learning STRIPS action models from
examples that compiles this inductive learning task into a classical planning
task. Interestingly, the compilation approach is flexible to different amounts
of available input knowledge; the learning examples can range from a set of
plans (with their corresponding initial and final states) to just a pair of
initial and final states (no intermediate action or state is given). Moreover,
the compilation accepts partially specified action models and it can be used to
validate whether the observation of a plan execution follows a given STRIPS
action model, even if this model is not fully specified.Comment: 8+1 pages, 4 figures, 6 table
Hierarchical Reinforcement Learning based on Planning Operators
Long-horizon manipulation tasks such as stacking represent a longstanding
challenge in the field of robotic manipulation, particularly when using
reinforcement learning (RL) methods which often struggle to learn the correct
sequence of actions for achieving these complex goals. To learn this sequence,
symbolic planning methods offer a good solution based on high-level reasoning,
however, planners often fall short in addressing the low-level control
specificity needed for precise execution. This paper introduces a novel
framework that integrates symbolic planning with hierarchical RL through the
cooperation of high-level operators and low-level policies. Our contribution
integrates planning operators (e.g. preconditions and effects) as part of the
hierarchical RL algorithm based on the Scheduled Auxiliary Control (SAC-X)
method. We developed a dual-purpose high-level operator, which can be used both
in holistic planning and as independent, reusable policies. Our approach offers
a flexible solution for long-horizon tasks, e.g., stacking a cube. The
experimental results show that our proposed method obtained an average of 97.2%
success rate for learning and executing the whole stack sequence, and the
success rate for learning independent policies, e.g. reach (98.9%), lift
(99.7%), stack (85%), etc. The training time is also reduced by 68% when using
our proposed approach
Learning Generalized Relational Heuristic Networks for Model-Agnostic Planning
Computing goal-directed behavior (sequential decision-making, or planning) is
essential to designing efficient AI systems. Due to the computational
complexity of planning, current approaches rely primarily upon hand-coded
symbolic domain models and hand-coded heuristic-function generators for
efficiency. Learned heuristics for such problems have been of limited utility
as they are difficult to apply to problems with objects and object quantities
that are significantly different from those in the training data. This paper
develops a new approach for learning generalized heuristics in the absence of
symbolic domain models using deep neural networks that utilize an input
predicate vocabulary but are agnostic to object names and quantities. It uses
an abstract state representation to facilitate data efficient, generalizable
learning. Empirical evaluation on a range of benchmark domains show that in
contrast to prior approaches, generalized heuristics computed by this method
can be transferred easily to problems with different objects and with object
quantities much larger than those in the training data.Comment: Submitted to NIPS 2020, 11 pages, 3 figure
YUMA â An AI Planning Agent for Composing IT Services from Infrastructure-as-Code Specifications
Infrastructure-as-code enables cloud architects to automate IT service delivery by specifying IT services through machine-readable definition files. To allow for a reusability of the infrastructure-as-code specifications, cloud architects specify IT services as compositions of sub-processes. As the AI planning agents for automated IT service composition proposed by prior research fall short in the infrastructure-as-code context, we design a search-based problem-solving agent named YUMA according to a design science research process to fill this research gap. YUMA holds a search tree reflecting the state space and transition model. It includes an algorithm for building the search tree and two algorithms for determining the minimum composition plan. The underlying IT service composition problem is explicated for the infrastructure-as-code context and formulated as a search problem. The results of the demonstration and evaluation show that YUMA fulfills the requirements necessary to solve this problem and digitizes an important task of cloud architects