5,073 research outputs found
Deep Reinforcement Learning for Tensegrity Robot Locomotion
Tensegrity robots, composed of rigid rods connected by elastic cables, have a
number of unique properties that make them appealing for use as planetary
exploration rovers. However, control of tensegrity robots remains a difficult
problem due to their unusual structures and complex dynamics. In this work, we
show how locomotion gaits can be learned automatically using a novel extension
of mirror descent guided policy search (MDGPS) applied to periodic locomotion
movements, and we demonstrate the effectiveness of our approach on tensegrity
robot locomotion. We evaluate our method with real-world and simulated
experiments on the SUPERball tensegrity robot, showing that the learned
policies generalize to changes in system parameters, unreliable sensor
measurements, and variation in environmental conditions, including varied
terrains and a range of different gravities. Our experiments demonstrate that
our method not only learns fast, power-efficient feedback policies for rolling
gaits, but that these policies can succeed with only the limited onboard
sensing provided by SUPERball's accelerometers. We compare the learned feedback
policies to learned open-loop policies and hand-engineered controllers, and
demonstrate that the learned policy enables the first continuous, reliable
locomotion gait for the real SUPERball robot. Our code and other supplementary
materials are available from http://rll.berkeley.edu/drl_tensegrityComment: International Conference on Robotics and Automation (ICRA), 2017.
Project website link is http://rll.berkeley.edu/drl_tensegrit
Efficient Humanoid Contact Planning using Learned Centroidal Dynamics Prediction
Humanoid robots dynamically navigate an environment by interacting with it
via contact wrenches exerted at intermittent contact poses. Therefore, it is
important to consider dynamics when planning a contact sequence. Traditional
contact planning approaches assume a quasi-static balance criterion to reduce
the computational challenges of selecting a contact sequence over a rough
terrain. This however limits the applicability of the approach when dynamic
motions are required, such as when walking down a steep slope or crossing a
wide gap. Recent methods overcome this limitation with the help of efficient
mixed integer convex programming solvers capable of synthesizing dynamic
contact sequences. Nevertheless, its exponential-time complexity limits its
applicability to short time horizon contact sequences within small
environments. In this paper, we go beyond current approaches by learning a
prediction of the dynamic evolution of the robot centroidal momenta, which can
then be used for quickly generating dynamically robust contact sequences for
robots with arms and legs using a search-based contact planner. We demonstrate
the efficiency and quality of the results of the proposed approach in a set of
dynamically challenging scenarios
Deep Kernels for Optimizing Locomotion Controllers
Sample efficiency is important when optimizing parameters of locomotion
controllers, since hardware experiments are time consuming and expensive.
Bayesian Optimization, a sample-efficient optimization framework, has recently
been widely applied to address this problem, but further improvements in sample
efficiency are needed for practical applicability to real-world robots and
high-dimensional controllers. To address this, prior work has proposed using
domain expertise for constructing custom distance metrics for locomotion. In
this work we show how to learn such a distance metric automatically. We use a
neural network to learn an informed distance metric from data obtained in
high-fidelity simulations. We conduct experiments on two different controllers
and robot architectures. First, we demonstrate improvement in sample efficiency
when optimizing a 5-dimensional controller on the ATRIAS robot hardware. We
then conduct simulation experiments to optimize a 16-dimensional controller for
a 7-link robot model and obtain significant improvements even when optimizing
in perturbed environments. This demonstrates that our approach is able to
enhance sample efficiency for two different controllers, hence is a fitting
candidate for further experiments on hardware in the future.Comment: (Rika Antonova and Akshara Rai contributed equally
Feedback Control of an Exoskeleton for Paraplegics: Toward Robustly Stable Hands-free Dynamic Walking
This manuscript presents control of a high-DOF fully actuated lower-limb
exoskeleton for paraplegic individuals. The key novelty is the ability for the
user to walk without the use of crutches or other external means of
stabilization. We harness the power of modern optimization techniques and
supervised machine learning to develop a smooth feedback control policy that
provides robust velocity regulation and perturbation rejection. Preliminary
evaluation of the stability and robustness of the proposed approach is
demonstrated through the Gazebo simulation environment. In addition,
preliminary experimental results with (complete) paraplegic individuals are
included for the previous version of the controller.Comment: Submitted to IEEE Control System Magazine. This version addresses
reviewers' concerns about the robustness of the algorithm and the motivation
for using such exoskeleton
Tuning Legged Locomotion Controllers via Safe Bayesian Optimization
This paper presents a data-driven strategy to streamline the deployment of
model-based controllers in legged robotic hardware platforms. Our approach
leverages a model-free safe learning algorithm to automate the tuning of
control gains, addressing the mismatch between the simplified model used in the
control formulation and the real system. This method substantially mitigates
the risk of hazardous interactions with the robot by sample-efficiently
optimizing parameters within a probably safe region. Additionally, we extend
the applicability of our approach to incorporate the different gait parameters
as contexts, leading to a safe, sample-efficient exploration algorithm capable
of tuning a motion controller for diverse gait patterns. We validate our method
through simulation and hardware experiments, where we demonstrate that the
algorithm obtains superior performance on tuning a model-based motion
controller for multiple gaits safely.Comment: This paper has been accepted to the 2023 Conference on Robot Learning
(CoRL 2023.) The first two authors contributed equally. The supplementary
video is available at https://youtu.be/zDBouUgegrU and the code
implementation is available at https://github.com/lasgroup/gosafeop
- …