Search CORE

14 research outputs found

Learning from Outside the Viability Kernel: Why we Should Build Robots that can Fall with Grace

Author: Heim Steve
Spröwitz Alexander
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Despite impressive results using reinforcement learning to solve complex problems from scratch, in robotics this has still been largely limited to model-based learning with very informative reward functions. One of the major challenges is that the reward landscape often has large patches with no gradient, making it difficult to sample gradients effectively. We show here that the robot state-initialization can have a more important effect on the reward landscape than is generally expected. In particular, we show the counter-intuitive benefit of including initializations that are unviable, in other words initializing in states that are doomed to fail.Comment: Proceedings of the 2018 IEEE International Conference on SImulation, Modeling and Programming for Autonomous Robots (SIMPAR), Brisbane, Australia, 16-19 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Beyond Basins of Attraction: Quantifying Robustness of Natural Dynamics

Author: Heim Steve
Spröwitz Alexander
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/08/2019
Field of study

Properly designing a system to exhibit favorable natural dynamics can greatly simplify designing or learning the control policy. However, it is still unclear what constitutes favorable natural dynamics and how to quantify its effect. Most studies of simple walking and running models have focused on the basins of attraction of passive limit-cycles and the notion of self-stability. We instead emphasize the importance of stepping beyond basins of attraction. We show an approach based on viability theory to quantify robust sets in state-action space. These sets are valid for the family of all robust control policies, which allows us to quantify the robustness inherent to the natural dynamics before designing the control policy or specifying a control objective. We illustrate our formulation using spring-mass models, simple low dimensional models of running systems. We then show an example application by optimizing robustness of a simulated planar monoped, using a gradient-free optimization scheme. Both case studies result in a nonlinear effective stiffness providing more robustness.Comment: 15 pages. This work has been accepted to IEEE Transactions on Robotics (2019

arXiv.org e-Print Archive

MPG.PuRe

Optimization-based Full Body Control for the DARPA Robotics Challenge

Author: Escande
Hutter
Jacobson
Kanoun
Khatib
Lasa
Liu
Nakamura
Righetti
Saab
Sentis
Stephens
Vukobratović
Wampler
Whitman
Wu
Publication venue: 'Wiley'
Publication date
Field of study

Crossref