17,283 research outputs found
Trajectory Optimization Through Contacts and Automatic Gait Discovery for Quadrupeds
In this work we present a trajectory Optimization framework for whole-body
motion planning through contacts. We demonstrate how the proposed approach can
be applied to automatically discover different gaits and dynamic motions on a
quadruped robot. In contrast to most previous methods, we do not pre-specify
contact switches, timings, points or gait patterns, but they are a direct
outcome of the optimization. Furthermore, we optimize over the entire dynamics
of the robot, which enables the optimizer to fully leverage the capabilities of
the robot. To illustrate the spectrum of achievable motions, here we show eight
different tasks, which would require very different control structures when
solved with state-of-the-art methods. Using our trajectory Optimization
approach, we are solving each task with a simple, high level cost function and
without any changes in the control structure. Furthermore, we fully integrated
our approach with the robot's control and estimation framework such that
optimization can be run online. By demonstrating a rough manipulation task with
multiple dynamic contact switches, we exemplarily show how optimized
trajectories and control inputs can be directly applied to hardware.Comment: Video: https://youtu.be/sILuqJBsyK
Learning a Unified Control Policy for Safe Falling
Being able to fall safely is a necessary motor skill for humanoids performing
highly dynamic tasks, such as running and jumping. We propose a new method to
learn a policy that minimizes the maximal impulse during the fall. The
optimization solves for both a discrete contact planning problem and a
continuous optimal control problem. Once trained, the policy can compute the
optimal next contacting body part (e.g. left foot, right foot, or hands),
contact location and timing, and the required joint actuation. We represent the
policy as a mixture of actor-critic neural network, which consists of n control
policies and the corresponding value functions. Each pair of actor-critic is
associated with one of the n possible contacting body parts. During execution,
the policy corresponding to the highest value function will be executed while
the associated body part will be the next contact with the ground. With this
mixture of actor-critic architecture, the discrete contact sequence planning is
solved through the selection of the best critics while the continuous control
problem is solved by the optimization of actors. We show that our policy can
achieve comparable, sometimes even higher, rewards than a recursive search of
the action space using dynamic programming, while enjoying 50 to 400 times of
speed gain during online execution
Learning Contact-Rich Manipulation Skills with Guided Policy Search
Autonomous learning of object manipulation skills can enable robots to
acquire rich behavioral repertoires that scale to the variety of objects found
in the real world. However, current motion skill learning methods typically
restrict the behavior to a compact, low-dimensional representation, limiting
its expressiveness and generality. In this paper, we extend a recently
developed policy search method \cite{la-lnnpg-14} and use it to learn a range
of dynamic manipulation behaviors with highly general policy representations,
without using known models or example demonstrations. Our approach learns a set
of trajectories for the desired motion skill by using iteratively refitted
time-varying linear models, and then unifies these trajectories into a single
control policy that can generalize to new situations. To enable this method to
run on a real robot, we introduce several improvements that reduce the sample
count and automate parameter selection. We show that our method can acquire
fast, fluent behaviors after only minutes of interaction time, and can learn
robust controllers for complex tasks, including putting together a toy
airplane, stacking tight-fitting lego blocks, placing wooden rings onto
tight-fitting pegs, inserting a shoe tree into a shoe, and screwing bottle caps
onto bottles
- …