77 research outputs found
Predictive and Robust Robot Assistance for Sequential Manipulation
This paper presents a novel concept to support physically impaired humans in
daily object manipulation tasks with a robot. Given a user's manipulation
sequence, we propose a predictive model that uniquely casts the user's
sequential behavior as well as a robot support intervention into a hierarchical
multi-objective optimization problem. A major contribution is the prediction
formulation, which allows to consider several different future paths
concurrently. The second contribution is the encoding of a general notion of
constancy constraints, which allows to consider dependencies between
consecutive or far apart keyframes (in time or space) of a sequential task. We
perform numerical studies, simulations and robot experiments to analyse and
evaluate the proposed method in several table top tasks where a robot supports
impaired users by predicting their posture and proactively re-arranging
objects
Assessing Transferability from Simulation to Reality for Reinforcement Learning
Learning robot control policies from physics simulations is of great interest
to the robotics community as it may render the learning process faster,
cheaper, and safer by alleviating the need for expensive real-world
experiments. However, the direct transfer of learned behavior from simulation
to reality is a major challenge. Optimizing a policy on a slightly faulty
simulator can easily lead to the maximization of the `Simulation Optimization
Bias` (SOB). In this case, the optimizer exploits modeling errors of the
simulator such that the resulting behavior can potentially damage the robot. We
tackle this challenge by applying domain randomization, i.e., randomizing the
parameters of the physics simulations during learning. We propose an algorithm
called Simulation-based Policy Optimization with Transferability Assessment
(SPOTA) which uses an estimator of the SOB to formulate a stopping criterion
for training. The introduced estimator quantifies the over-fitting to the set
of domains experienced while training. Our experimental results on two
different second order nonlinear systems show that the new simulation-based
policy search algorithm is able to learn a control policy exclusively from a
randomized simulator, which can be applied directly to real systems without any
additional training
Learning from Few Demonstrations with Frame-Weighted Motion Generation
Learning from Demonstration (LfD) enables robots to acquire versatile skills
by learning motion policies from human demonstrations. It endows users with an
intuitive interface to transfer new skills to robots without the need for
time-consuming robot programming and inefficient solution exploration. During
task executions, the robot motion is usually influenced by constraints imposed
by environments. In light of this, task-parameterized LfD (TP-LfD) encodes
relevant contextual information into reference frames, enabling better skill
generalization to new situations. However, most TP-LfD algorithms typically
require multiple demonstrations across various environmental conditions to
ensure sufficient statistics for a meaningful model. It is not a trivial task
for robot users to create different situations and perform demonstrations under
all of them. Therefore, this paper presents a novel algorithm to learn skills
from few demonstrations. By leveraging the reference frame weights that capture
the frame importance or relevance during task executions, our method
demonstrates excellent skill acquisition performance, which is validated in
real robotic environments.Comment: Accepted by ISER. For the experiment video, see
https://youtu.be/JpGjk4eKC3
Learning Utility Surfaces for Movement Selection
Humanoid robots are highly redundant systems with respect to the tasks they are asked to perform. This redundancy manifests itself in the number of degrees of freedom of the robot exceeding the dimensionality of the task. Traditionally this redundancy has been utilised through optimal control in the null-space. Some cost function is defined that encodes secondary movement goals and movements are optimised with respect to this functio
Multi-mode Trajectory Optimization for Impact-aware Manipulation
The transition from free motion to contact is a challenging problem in
robotics, in part due to its hybrid nature. Additionally, disregarding the
effects of impacts at the motion planning level often results in intractable
impulsive contact forces. In this paper, we introduce an impact-aware
multi-mode trajectory optimization (TO) method that combines hybrid dynamics
and hybrid control in a coherent fashion. A key concept is the incorporation of
an explicit contact force transmission model in the TO method. This allows the
simultaneous optimization of the contact forces, contact timings, continuous
motion trajectories and compliance, while satisfying task constraints. We
compare our method against standard compliance control and an impact-agnostic
TO method in physical simulations. Further, we experimentally validate the
proposed method with a robot manipulator on the task of halting a
large-momentum object
Set-based State Estimation with Probabilistic Consistency Guarantee under Epistemic Uncertainty
Consistent state estimation is challenging, especially under the epistemic
uncertainties arising from learned (nonlinear) dynamic and observation models.
In this work, we propose a set-based estimation algorithm, named Gaussian
Process-Zonotopic Kalman Filter (GP-ZKF), that produces zonotopic state
estimates while respecting both the epistemic uncertainties in the learned
models and aleatoric uncertainties. Our method guarantees probabilistic
consistency, in the sense that the true states are bounded by sets (zonotopes)
across all time steps, with high probability. We formally relate GP-ZKF with
the corresponding stochastic approach, GP-EKF, in the case of learned
(nonlinear) models. In particular, when linearization errors and aleatoric
uncertainties are omitted and epistemic uncertainties are simplified, GP-ZKF
reduces to GP-EKF. We empirically demonstrate our method's efficacy in both a
simulated pendulum domain and a real-world robot-assisted dressing domain,
where GP-ZKF produced more consistent and less conservative set-based estimates
than all baseline stochastic methods.Comment: Published at IEEE Robotics and Automation Letters, 2022. Video:
https://www.youtube.com/watch?v=CvIPJlALaFU Copyright: 2022 IEEE. Personal
use of this material is permitted. Permission from IEEE must be obtained for
all other uses, in any media, including reprinting/republishing for any
purposes, creating new works, for resale or redistribution, or reuse of any
copyrighted component of this wor
- …