Search CORE

8 research outputs found

Bipedal Walking Energy Minimization by Reinforcement Learning with Evolving Policy Parameterization

Author: Caldwell DG
Calinon S
Kormushev P
Tsagarakis N
Ugurlu B
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

We present a learning-based approach for minimizing the electric energy consumption during walking of a passively-compliant bipedal robot. The energy consumption is reduced by learning a varying-height center-of-mass trajectory which uses efficiently the robots passive compliance. To do this, we propose a reinforcement learning method which evolves the policy parameterization dynamically during the learning process and thus manages to find better policies faster than by using fixed parameterization. The method is first tested on a function approximation task, and then applied to the humanoid robot COMAN where it achieves significant energy reduction. © 2011 IEEE

CiteSeerX

Crossref

Spiral - Imperial College Digital Repository

Synergy-based policy improvement with path integrals for anthropomorphic hands

Author: Ficuciello Fanny
Siciliano Bruno
Zaccara Damiano
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

In this work, a synergy-based reinforcement learning algorithm has been developed to confer autonomous grasping capabilities to anthropomorphic hands. In the presence of high degrees of freedom, classical machine learning techniques require a number of iterations that increases with the size of the problem, thus convergence of the solution is not ensured. The use of postural synergies determines dimensionality reduction of the search space and allows recent learning techniques, such as Policy Improvement with Path Integrals, to become easily applicable. A key point is the adoption of a suitable reward function representing the goal of the task and ensuring onestep performance evaluation. Force-closure quality of the grasp in the synergies subspace has been chosen as a cost function for performance evaluation. The experiments conducted on the SCHUNK 5-Finger Hand demonstrate the effectiveness of the algorithm showing skills comparable to human capabilities in learning new grasps and in performing a wide variety from power to high precision grasps of very small objects

Archivio della ricerca - Università degli studi di Napoli Federico II

Crossref

Preface

Author: Kormushev Petar
Olson Edwin
Saxena Ashutosh
Takano Wataru
Publication venue
Publication date: 01/09/2012
Field of study

Crossref

Open Access Repository

Lamarckian Evolution of Simulated Modular Robots

Author: A. E. Eiben
Evert Haasdijk
Kyrre Glette
Milan Jelisavcic
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2019
Field of study

We study evolutionary robot systems where not only the robot brains but also the robot bodies are evolvable. Such systems need to include a learning period right after ‘birth' to acquire a controller that fits the newly created body. In this paper we investigate the possibility of bootstrapping infant robot learning through employing Lamarckian inheritance of parental controllers. In our system controllers are encoded by a combination of a morphology dependent component, a Central Pattern Generator (CPG), and a morphology independent part, a Compositional Pattern Producing Network (CPPN). This makes it possible to transfer the CPPN part of controllers between different morphologies and to create a Lamarckian system. We conduct experiments with simulated modular robots whose fitness is determined by the speed of locomotion, establish the benefits of inheriting optimized parental controllers, shed light on the conditions that influence these benefits, and observe that changing the way controllers are evolved also impacts the evolved morphologies

VU Research Portal

Directory of Open Access Journals

NORA - Norwegian Open Research Archives

Dynamic Walking: Toward Agile and Efficient Bipedal Robots

Author: Ames Aaron D.
Reher Jenna
Publication venue
Publication date: 15/10/2020
Field of study

Dynamic walking on bipedal robots has evolved from an idea in science fiction to a practical reality. This is due to continued progress in three key areas: a mathematical understanding of locomotion, the computational ability to encode this mathematics through optimization, and the hardware capable of realizing this understanding in practice. In this context, this review article outlines the end-to-end process of methods which have proven effective in the literature for achieving dynamic walking on bipedal robots. We begin by introducing mathematical models of locomotion, from reduced order models that capture essential walking behaviors to hybrid dynamical systems that encode the full order continuous dynamics along with discrete footstrike dynamics. These models form the basis for gait generation via (nonlinear) optimization problems. Finally, models and their generated gaits merge in the context of real-time control, wherein walking behaviors are translated to hardware. The concepts presented are illustrated throughout in simulation, and experimental instantiation on multiple walking platforms are highlighted to demonstrate the ability to realize dynamic walking on bipedal robots that is agile and efficient

Reinforcement Learning Framework for the self-learning Suppression of Clutch Judder in automotive Drive Trains

Author: Sommer Obando Hermann
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2016
Field of study

In electromechanically actuated clutches, the active damping of vibrations by means of control of the clamping force allow the use of high performance materials in the friction pairing, which makes a more energy and cost efficient design of the clutch. In this work, a reinforcement learning framework for the control of the clamping force for the active suppression of judder vibrations is proposed and developed

KITopen

Development of an Optimized Omnidirectional Walk Engine for Humanoid Robots

Author: Nima Shafii
Publication venue
Publication date: 21/04/2015
Field of study

Repositório Aberto da Universidade do Porto