183,198 research outputs found

    On the Convergence of Techniques that Improve Value Iteration

    Get PDF
    Prioritisation of Bellman backups or updating only a small subset of actions represent important techniques for speeding up planning in MDPs. The recent literature showed new efficient approaches which exploit these directions. Backward value iteration and backing up only the best actions were shown to lead to a significant reduction of the planning time. This paper conducts a theoretical and empirical analysis of these techniques and shows new important proofs. In particular, (1) it identifies weaker requirements for the convergence of backups based on best actions only, (2) a new method for evaluation of the Bellman error is shown for the update that updates one best action once, (3) it presents the theoretical proof of backward value iteration and establishes required initialisation, (4) and shows that the default state ordering of backups in standard value iteration can significantly influence its performance. Additionally, (5) the existing literature did not compare these methods, either empirically or analytically, against policy iteration. The rigorous empirical and novel theoretical parts of the paper reveal important associations and allow drawing guidelines on which type of value or policy iteration is suitable for a given domain. Finally, our chief message is that standard value iteration can be made far more efficient by simple modifications shown in the paper

    High level task planning with inference for the TIAGo robot

    Get PDF
    The need to combine task planning and motion planning in robotics is well understood. The task planner generates a plan to solve the problem while the motion planner executes the actions of the problem. The previous framework is applied in many state machines that solve complex problems. But in this project we want to present an interface that communicates the task planner layer and the motion planner layer, and updates the geometric information of the environment to inform the task planner. This framework allows to solve complex tasks with basic information of the goal, and replan whenever the motion could not be executed. All the information of the problems is modelled as logical predicates. The objective of this project is to generate a generic model of the environment, with a set of feasible motions of the robot, and use this interface to solve many different planning problems involving those actions, by just giving simple goals. The result is to make the robot more autonomous and allow that any user could use it by giving simple orders. Moreover this project presents the different frameworks and algorithms used to simulate those actions in the robot such as: Sequential Quadratic Programming optimization, Rapidly Random Exploring Tree (RRT) or SBPL global planning. It also shows an introduction to PDDL language used to model the problem and the actions, and the Fast-Froward (FF) solver that is the responsible to translate the problem as a graph and solve it. Finally we test it on different experiments in simulation, by using the TIAGo platform of PAL robotics. The results are promising and allow to dream in service robots solving complex tasks simply computing and modelling basic actions

    Use-cases on evolution

    Get PDF
    This report presents a set of use cases for evolution and reactivity for data in the Web and Semantic Web. This set is organized around three different case study scenarios, each of them is related to one of the three different areas of application within Rewerse. Namely, the scenarios are: “The Rewerse Information System and Portal”, closely related to the work of A3 – Personalised Information Systems; “Organizing Travels”, that may be related to the work of A1 – Events, Time, and Locations; “Updates and evolution in bioinformatics data sources” related to the work of A2 – Towards a Bioinformatics Web
    • …
    corecore