58 research outputs found
OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors
Reinforcement Learning (RL) has seen many recent successes for quadruped
robot control. The imitation of reference motions provides a simple and
powerful prior for guiding solutions towards desired solutions without the need
for meticulous reward design. While much work uses motion capture data or
hand-crafted trajectories as the reference motion, relatively little work has
explored the use of reference motions coming from model-based trajectory
optimization. In this work, we investigate several design considerations that
arise with such a framework, as demonstrated through four dynamic behaviours:
trot, front hop, 180 backflip, and biped stepping. These are trained in
simulation and transferred to a physical Solo 8 quadruped robot without further
adaptation. In particular, we explore the space of feed-forward designs
afforded by the trajectory optimizer to understand its impact on RL learning
efficiency and sim-to-real transfer. These findings contribute to the long
standing goal of producing robot controllers that combine the interpretability
and precision of model-based optimization with the robustness that model-free
RL-based controllers offer
Hierarchical Planning and Control for Box Loco-Manipulation
Humans perform everyday tasks using a combination of locomotion and
manipulation skills. Building a system that can handle both skills is essential
to creating virtual humans. We present a physically-simulated human capable of
solving box rearrangement tasks, which requires a combination of both skills.
We propose a hierarchical control architecture, where each level solves the
task at a different level of abstraction, and the result is a physics-based
simulated virtual human capable of rearranging boxes in a cluttered
environment. The control architecture integrates a planner, diffusion models,
and physics-based motion imitation of sparse motion clips using deep
reinforcement learning. Boxes can vary in size, weight, shape, and placement
height. Code and trained control policies are provided
Recommended from our members
Joint analysis of three genome-wide association studies of esophageal squamous cell carcinoma in Chinese populations
We conducted a joint (pooled) analysis of three genome-wide association studies (GWAS) 1-3 of esophageal squamous cell carcinoma (ESCC) in ethnic Chinese (5,337 ESCC cases and 5,787 controls) with 9,654 ESCC cases and 10,058 controls for follow-up. In a logistic regression model adjusted for age, sex, study, and two eigenvectors, two new loci achieved genome-wide significance, marked by rs7447927 at 5q31.2 (per-allele odds ratio (OR) = 0.85, 95% CI 0.82-0.88; P=7.72x10−20) and rs1642764 at 17p13.1 (per-allele OR= 0.88, 95% CI 0.85-0.91; P=3.10x10−13). rs7447927 is a synonymous single nucleotide polymorphism (SNP) in TMEM173 and rs1642764 is an intronic SNP in ATP1B2, near TP53. Furthermore, a locus in the HLA class II region at 6p21.32 (rs35597309) achieved genome-wide significance in the two populations at highest risk for ESSC (OR=1.33, 95% CI 1.22-1.46; P=1.99x10−10). Our joint analysis identified new ESCC susceptibility loci overall as well as a new locus unique to the ESCC high risk Taihang Mountain region
- …