Search CORE

1,115 research outputs found

Biped dynamic walking using reinforcement learning

Author: Benbrahim Hamid
Publication venue: University of New Hampshire Scholars\u27 Repository
Publication date: 01/01/1996
Field of study

This thesis presents a study of biped dynamic walking using reinforcement learning. A hardware biped robot was built. It uses low gear ratio DC motors in order to provide free leg movements. The Self Scaling Reinforcement learning algorithm was developed in order to deal with the problem of reinforcement learning in continuous action domains. A new learning architecture was designed to solve complex control problems. It uses different modules that consist of simple controllers and small neural networks. The architecture allows for easy incorporation of modules that represent new knowledge, or new requirements for the desired task. Control experiments were carried out using a simulator and the physical biped. The biped learned dynamic walking on flat surfaces without any previous knowledge about its dynamic model

UNH Scholars' Repository

Stable Walking Pattern Generation for a Biped Robot Using Reinforcement Learning

Author: Jun Ho Oh
Jungho Lee
Publication venue: 'IntechOpen'
Publication date: 01/01/2009
Field of study

IntechOpen

Bayesian Optimization Using Domain Knowledge on the ATRIAS Biped

Author: Antonova Rika
Atkeson Christopher G.
Geyer Hartmut
Martin William
Rai Akshara
Song Seungmoon
Publication venue
Publication date: 18/09/2017
Field of study

Controllers in robotics often consist of expert-designed heuristics, which can be hard to tune in higher dimensions. It is typical to use simulation to learn these parameters, but controllers learned in simulation often don't transfer to hardware. This necessitates optimization directly on hardware. However, collecting data on hardware can be expensive. This has led to a recent interest in adapting data-efficient learning techniques to robotics. One popular method is Bayesian Optimization (BO), a sample-efficient black-box optimization scheme, but its performance typically degrades in higher dimensions. We aim to overcome this problem by incorporating domain knowledge to reduce dimensionality in a meaningful way, with a focus on bipedal locomotion. In previous work, we proposed a transformation based on knowledge of human walking that projected a 16-dimensional controller to a 1-dimensional space. In simulation, this showed enhanced sample efficiency when optimizing human-inspired neuromuscular walking controllers on a humanoid model. In this paper, we present a generalized feature transform applicable to non-humanoid robot morphologies and evaluate it on the ATRIAS bipedal robot -- in simulation and on hardware. We present three different walking controllers; two are evaluated on the real robot. Our results show that this feature transform captures important aspects of walking and accelerates learning on hardware and simulation, as compared to traditional BO.Comment: 8 pages, submitted to IEEE International Conference on Robotics and Automation 201

arXiv.org e-Print Archive

Crossref

Reinforcement Learning Algorithms in Humanoid Robotics

Author: Dusko Katic
Miomir Vukobratovic
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

Crossref