Search CORE

284 research outputs found

Incorporating prior knowledge into deep neural network controllers of legged robots

Author: Degrave Jonas
Publication venue: Universiteit Gent. Faculteit Ingenieurswetenschappen en Architectuur
Publication date: 01/01/2018
Field of study

RSG: Fast Learning Adaptive Skills for Quadruped Robots by Skill Graph

Author: Gai Sibo
Lyu Shangke
Shi Diyuan
Wang Donglin
Wei Zhenyu
Zhang Hongyin
Zhao Feng
Zhao Han
Zhuang Zifeng
Publication venue
Publication date: 10/11/2023
Field of study

Developing robotic intelligent systems that can adapt quickly to unseen wild situations is one of the critical challenges in pursuing autonomous robotics. Although some impressive progress has been made in walking stability and skill learning in the field of legged robots, their ability to fast adaptation is still inferior to that of animals in nature. Animals are born with massive skills needed to survive, and can quickly acquire new ones, by composing fundamental skills with limited experience. Inspired by this, we propose a novel framework, named Robot Skill Graph (RSG) for organizing massive fundamental skills of robots and dexterously reusing them for fast adaptation. Bearing a structure similar to the Knowledge Graph (KG), RSG is composed of massive dynamic behavioral skills instead of static knowledge in KG and enables discovering implicit relations that exist in be-tween of learning context and acquired skills of robots, serving as a starting point for understanding subtle patterns existing in robots' skill learning. Extensive experimental results demonstrate that RSG can provide rational skill inference upon new tasks and environments and enable quadruped robots to adapt to new scenarios and learn new skills rapidly

arXiv.org e-Print Archive

Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion

Author: Cao Yunhao
Levine Sergey
Smith Laura
Publication venue
Publication date: 26/10/2023
Field of study

Deep reinforcement learning (RL) can enable robots to autonomously acquire complex behaviors, such as legged locomotion. However, RL in the real world is complicated by constraints on efficiency, safety, and overall training stability, which limits its practical applicability. We present APRL, a policy regularization framework that modulates the robot's exploration over the course of training, striking a balance between flexible improvement potential and focused, efficient exploration. APRL enables a quadrupedal robot to efficiently learn to walk entirely in the real world within minutes and continue to improve with more training where prior work saturates in performance. We demonstrate that continued training with APRL results in a policy that is substantially more capable of navigating challenging situations and is able to adapt to changes in dynamics with continued training.Comment: First two authors contributed equally. Project website: https://sites.google.com/berkeley.edu/apr

arXiv.org e-Print Archive