Search CORE

118,571 research outputs found

Robust Control of Uncertain Markov Decision Processes with Temporal Logic Specifications

Author: Murray Richard M.
Topcu Ufuk
Wolff Eric M.
Publication venue: 'California Institute of Technology Library'
Publication date: 25/09/2011
Field of study

We present a method for designing robust controllers for dynamical systems with linear temporal logic specifications. We abstract the original system by a finite Markov Decision Process (MDP) that has transition probabilities in a specified uncertainty set. A robust control policy for the MDP is generated that maximizes the worst-case probability of satisfying the specification over all transition probabilities in the uncertainty set. To do this, we use a procedure from probabilistic model checking to combine the system model with an automaton representing the specification. This new MDP is then transformed into an equivalent form that satisfies assumptions for stochastic shortest path dynamic programming. A robust version of dynamic programming allows us to solve for a

\epsilon

-suboptimal robust control policy with time complexity

O(\log 1/\epsilon)

times that for the non-robust case. We then implement this control policy on the original dynamical system

CiteSeerX

Caltech Authors

Neural Task Programming: Learning to Generalize Across Hierarchical Tasks

Author: Fei-Fei Li
Gao Julian
Garg Animesh
Nair Suraj
Savarese Silvio
Xu Danfei
Zhu Yuke
Publication venue
Publication date: 14/03/2018
Field of study

In this work, we propose a novel robot learning framework called Neural Task Programming (NTP), which bridges the idea of few-shot learning from demonstration and neural program induction. NTP takes as input a task specification (e.g., video demonstration of a task) and recursively decomposes it into finer sub-task specifications. These specifications are fed to a hierarchical neural program, where bottom-level programs are callable subroutines that interact with the environment. We validate our method in three robot manipulation tasks. NTP achieves strong generalization across sequential tasks that exhibit hierarchal and compositional structures. The experimental results show that NTP learns to generalize well to- wards unseen tasks with increasing lengths, variable topologies, and changing objectives.Comment: ICRA 201

arXiv.org e-Print Archive

Caltech Authors