Search CORE

21,535 research outputs found

Learning STRIPS Action Models with Classical Planning

Author: Aineto Diego
Jiménez Sergio
Onaindia Eva
Publication venue
Publication date: 15/06/2018
Field of study

This paper presents a novel approach for learning STRIPS action models from examples that compiles this inductive learning task into a classical planning task. Interestingly, the compilation approach is flexible to different amounts of available input knowledge; the learning examples can range from a set of plans (with their corresponding initial and final states) to just a pair of initial and final states (no intermediate action or state is given). Moreover, the compilation accepts partially specified action models and it can be used to validate whether the observation of a plan execution follows a given STRIPS action model, even if this model is not fully specified.Comment: 8+1 pages, 4 figures, 6 table

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Learning Hierarchical Task Networks Using Semantic Word Embeddings

Author: Gopalakrishnan Sriram
Publication venue: Lehigh Preserve
Publication date
Field of study

This thesis describes WORD2HTN, which is a novel and semantic approach for learning hierarchical task networks (HTN) and semantic division of goals from input plan traces. The semantic relationships are learned using machine learning to get the vector representations of the components of the plan trace. The semantic relationships are used to learn hierarchical landmarks, which in turn are used to make semantically divided HTNs. These learned HTNs can then be used for subsequent new problems in the domain that have a similar structure with the problems in the input plan traces. This work also improves the learning algorithm to include arithmetic conditions and effects. WORD2HTN was tested on 3 deterministic domains. These are Logistics or Transportation domain, Abstract Graph domain, and the Malmo interface for the Minecraft game. We show that WORD2HTN learns semantically divided HTNs. We also experimentally demonstrate that HTN planners using this have an exponential speedup in information-dense domains over the state of the art classical planner. Finally, we show that the HTNs learned in Minecraft can be used to achieve tasks faster with a cooperative agent controlled by the HTN planner’s output

Lehigh University: Lehigh Preserve

STRIPS Action Discovery

Author: Alenyà Guillem
Segovia-Aguas Javier
Suárez-Hernández Alejandro
Torras Carme
Publication venue
Publication date: 01/01/2020
Field of study

The problem of specifying high-level knowledge bases for planning becomes a hard task in realistic environments. This knowledge is usually handcrafted and is hard to keep updated, even for system experts. Recent approaches have shown the success of classical planning at synthesizing action models even when all intermediate states are missing. These approaches can synthesize action schemas in Planning Domain Definition Language (PDDL) from a set of execution traces each consisting, at least, of an initial and final state. In this paper, we propose a new algorithm to unsupervisedly synthesize STRIPS action models with a classical planner when action signatures are unknown. In addition, we contribute with a compilation to classical planning that mitigates the problem of learning static predicates in the action model preconditions, exploits the capabilities of SAT planners with parallel encodings to compute action schemas and validate all instances. Our system is flexible in that it supports the inclusion of partial input information that may speed up the search. We show through several experiments how learned action models generalize over unseen planning instances.Comment: Presented to Genplan 2020 workshop, held in the AAAI 2020 conference (https://sites.google.com/view/genplan20) (2021/03/05: included missing acknowledgments

arXiv.org e-Print Archive

UPCommons. Portal del coneixement obert de la UPC

Digital.CSIC

HTN planning: Overview, comparison, and beyond

Author: Aiello M.
Georgievski I.
Publication venue
Publication date: 01/05/2015
Field of study

Hierarchies are one of the most common structures used to understand and conceptualise the world. Within the field of Artificial Intelligence (AI) planning, which deals with the automation of world-relevant problems, Hierarchical Task Network (HTN) planning is the branch that represents and handles hierarchies. In particular, the requirement for rich domain knowledge to characterise the world enables HTN planning to be very useful, and also to perform well. However, the history of almost 40 years obfuscates the current understanding of HTN planning in terms of accomplishments, planning models, similarities and differences among hierarchical planners, and its current and objective image. On top of these issues, the ability of hierarchical planning to truly cope with the requirements of real-world applications has been often questioned. As a remedy, we propose a framework-based approach where we first provide a basis for defining different formal models of hierarchical planning, and define two models that comprise a large portion of HTN planners. Second, we provide a set of concepts that helps in interpreting HTN planners from the aspect of their search space. Then, we analyse and compare the planners based on a variety of properties organised in five segments, namely domain authoring, expressiveness, competence, computation and applicability. Furthermore, we select Web service composition as a real-world and current application, and classify and compare the approaches that employ HTN planning to solve the problem of service composition. Finally, we conclude with our findings and present directions for future work. In summary, we provide a novel and comprehensive viewpoint on a core AI planning technique.<br/

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Value Propagation Networks

Author: Kohli Pushmeet
Lin Zeming
Nardelli Nantas
Synnaeve Gabriel
Torr Philip H. S.
Usunier Nicolas
Publication venue
Publication date: 01/01/2019
Field of study

We present Value Propagation (VProp), a set of parameter-efficient differentiable planning modules built on Value Iteration which can successfully be trained using reinforcement learning to solve unseen tasks, has the capability to generalize to larger map sizes, and can learn to navigate in dynamic environments. We show that the modules enable learning to plan when the environment also includes stochastic elements, providing a cost-efficient learning system to build low-level size-invariant planners for a variety of interactive navigation problems. We evaluate on static and dynamic configurations of MazeBase grid-worlds, with randomly generated environments of several different sizes, and on a StarCraft navigation scenario, with more complex dynamics, and pixels as input.Comment: Updated to match ICLR 2019 OpenReview's versio

arXiv.org e-Print Archive

Oxford University Research Archive