2,138 research outputs found
Learning to Represent Haptic Feedback for Partially-Observable Tasks
The sense of touch, being the earliest sensory system to develop in a human
body [1], plays a critical part of our daily interaction with the environment.
In order to successfully complete a task, many manipulation interactions
require incorporating haptic feedback. However, manually designing a feedback
mechanism can be extremely challenging. In this work, we consider manipulation
tasks that need to incorporate tactile sensor feedback in order to modify a
provided nominal plan. To incorporate partial observation, we present a new
framework that models the task as a partially observable Markov decision
process (POMDP) and learns an appropriate representation of haptic feedback
which can serve as the state for a POMDP model. The model, that is parametrized
by deep recurrent neural networks, utilizes variational Bayes methods to
optimize the approximate posterior. Finally, we build on deep Q-learning to be
able to select the optimal action in each state without access to a simulator.
We test our model on a PR2 robot for multiple tasks of turning a knob until it
clicks.Comment: IEEE International Conference on Robotics and Automation (ICRA), 201
Decision-Theoretic Planning with Person Trajectory Prediction for Social Navigation
Robots navigating in a social way should reason about people intentions
when acting. For instance, in applications like robot guidance or meeting with a
person, the robot has to consider the goals of the people. Intentions are inherently nonobservable,
and thus we propose Partially Observable Markov Decision Processes
(POMDPs) as a decision-making tool for these applications. One of the issues with
POMDPs is that the prediction models are usually handcrafted. In this paper, we use
machine learning techniques to build prediction models from observations. A novel
technique is employed to discover points of interest (goals) in the environment, and a
variant of Growing Hidden Markov Models (GHMMs) is used to learn the transition
probabilities of the POMDP. The approach is applied to an autonomous telepresence
robot
- …