Search CORE

56 research outputs found

Learning Domain-Independent Planning Heuristics with Hypergraph Networks

Author: Shen William
Thiébaux Sylvie
Trevizan Felipe
Publication venue
Publication date: 29/11/2019
Field of study

We present the first approach capable of learning domain-independent planning heuristics entirely from scratch. The heuristics we learn map the hypergraph representation of the delete-relaxation of the planning problem at hand, to a cost estimate that approximates that of the least-cost path from the current state to the goal through the hypergraph. We generalise Graph Networks to obtain a new framework for learning over hypergraphs, which we specialise to learn planning heuristics by training over state/value pairs obtained from optimal cost plans. Our experiments show that the resulting architecture, STRIPS-HGNs, is capable of learning heuristics that are competitive with existing delete-relaxation heuristics including LM-cut. We show that the heuristics we learn are able to generalise across different problems and domains, including to domains that were not seen during training

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Exploiting First-Order Regression in Inductive Policy Selection

Author: Charles Gretton
Sylvie Thiébaux
Publication venue
Publication date: 01/01/2004
Field of study

We consider the problem of computing optimal generalised policies for relational Markov decision processes. We describe an approach combining some of the benefits of purely inductive techniques with those of symbolic dynamic programming methods. The latter reason about the optimal value function using first-order decisiontheoretic regression and formula rewriting, while the former, when provided with a suitable hypotheses language, are capable of generalising value functions or policies for small instances. Our idea is to use reasoning and in particular classical first-order regression to automatically generate a hypotheses language dedicated to the domain at hand, which is then used as input by an inductive solver. This approach avoids the more complex reasoning of symbolic dynamic programming while focusing the inductive solver’s attention on concepts that are specifically relevant to the optimal value function for the domain considered.

CiteSeerX

Operations Planning

Author: Buffet Olivier
Thiébaux Sylvie
Publication venue: ISTE Ltd and John Wiley & Sons Inc
Publication date: 01/01/2010
Field of study

INRIA a CCSD electronic archive server