29 research outputs found

    Learning STRIPS Action Models with Classical Planning

    Full text link
    This paper presents a novel approach for learning STRIPS action models from examples that compiles this inductive learning task into a classical planning task. Interestingly, the compilation approach is flexible to different amounts of available input knowledge; the learning examples can range from a set of plans (with their corresponding initial and final states) to just a pair of initial and final states (no intermediate action or state is given). Moreover, the compilation accepts partially specified action models and it can be used to validate whether the observation of a plan execution follows a given STRIPS action model, even if this model is not fully specified.Comment: 8+1 pages, 4 figures, 6 table

    Hierarchical Reinforcement Learning based on Planning Operators

    Full text link
    Long-horizon manipulation tasks such as stacking represent a longstanding challenge in the field of robotic manipulation, particularly when using reinforcement learning (RL) methods which often struggle to learn the correct sequence of actions for achieving these complex goals. To learn this sequence, symbolic planning methods offer a good solution based on high-level reasoning, however, planners often fall short in addressing the low-level control specificity needed for precise execution. This paper introduces a novel framework that integrates symbolic planning with hierarchical RL through the cooperation of high-level operators and low-level policies. Our contribution integrates planning operators (e.g. preconditions and effects) as part of the hierarchical RL algorithm based on the Scheduled Auxiliary Control (SAC-X) method. We developed a dual-purpose high-level operator, which can be used both in holistic planning and as independent, reusable policies. Our approach offers a flexible solution for long-horizon tasks, e.g., stacking a cube. The experimental results show that our proposed method obtained an average of 97.2% success rate for learning and executing the whole stack sequence, and the success rate for learning independent policies, e.g. reach (98.9%), lift (99.7%), stack (85%), etc. The training time is also reduced by 68% when using our proposed approach

    Learning Generalized Relational Heuristic Networks for Model-Agnostic Planning

    Full text link
    Computing goal-directed behavior (sequential decision-making, or planning) is essential to designing efficient AI systems. Due to the computational complexity of planning, current approaches rely primarily upon hand-coded symbolic domain models and hand-coded heuristic-function generators for efficiency. Learned heuristics for such problems have been of limited utility as they are difficult to apply to problems with objects and object quantities that are significantly different from those in the training data. This paper develops a new approach for learning generalized heuristics in the absence of symbolic domain models using deep neural networks that utilize an input predicate vocabulary but are agnostic to object names and quantities. It uses an abstract state representation to facilitate data efficient, generalizable learning. Empirical evaluation on a range of benchmark domains show that in contrast to prior approaches, generalized heuristics computed by this method can be transferred easily to problems with different objects and with object quantities much larger than those in the training data.Comment: Submitted to NIPS 2020, 11 pages, 3 figure

    YUMA – An AI Planning Agent for Composing IT Services from Infrastructure-as-Code Specifications

    Get PDF
    Infrastructure-as-code enables cloud architects to automate IT service delivery by specifying IT services through machine-readable definition files. To allow for a reusability of the infrastructure-as-code specifications, cloud architects specify IT services as compositions of sub-processes. As the AI planning agents for automated IT service composition proposed by prior research fall short in the infrastructure-as-code context, we design a search-based problem-solving agent named YUMA according to a design science research process to fill this research gap. YUMA holds a search tree reflecting the state space and transition model. It includes an algorithm for building the search tree and two algorithms for determining the minimum composition plan. The underlying IT service composition problem is explicated for the infrastructure-as-code context and formulated as a search problem. The results of the demonstration and evaluation show that YUMA fulfills the requirements necessary to solve this problem and digitizes an important task of cloud architects
    corecore