Search CORE

8,372 research outputs found

Universal Learning of Repeated Matrix Games

Author: Hutter Marcus
Poland Jan
Publication venue
Publication date: 01/01/2005
Field of study

We study and compare the learning dynamics of two universal learning algorithms, one based on Bayesian learning and the other on prediction with expert advice. Both approaches have strong asymptotic performance guarantees. When confronted with the task of finding good long-term strategies in repeated 2x2 matrix games, they behave quite differently.Comment: 16 LaTeX pages, 8 eps figure

arXiv.org e-Print Archive

CiteSeerX

The Australian National University

Body randomization reduces the sim-to-real gap for compliant quadruped locomotion

Author: Dambre Joni
Mahmud Hossain
Urbain Gabriel
Vandesompele Alexander
wyffels Francis
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2019
Field of study

Designing controllers for compliant, underactuated robots is challenging and usually requires a learning procedure. Learning robotic control in simulated environments can speed up the process whilst lowering risk of physical damage. Since perfect simulations are unfeasible, several techniques are used to improve transfer to the real world. Here, we investigate the impact of randomizing body parameters during learning of CPG controllers in simulation. The controllers are evaluated on our physical quadruped robot. We find that body randomization in simulation increases chances of finding gaits that function well on the real robot

Ghent University Academic Bibliography

Directory of Open Access Journals

Human Like Adaptation of Force and Impedance in Stable and Unstable Tasks

Author: Albu-Schaeffer A
Burdet E
Ganesh G
Haddadin S
Parusel , S
Yang C
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

Abstract—This paper presents a novel human-like learning con-troller to interact with unknown environments. Strictly derived from the minimization of instability, motion error, and effort, the controller compensates for the disturbance in the environment in interaction tasks by adapting feedforward force and impedance. In contrast with conventional learning controllers, the new controller can deal with unstable situations that are typical of tool use and gradually acquire a desired stability margin. Simulations show that this controller is a good model of human motor adaptation. Robotic implementations further demonstrate its capabilities to optimally adapt interaction with dynamic environments and humans in joint torque controlled robots and variable impedance actuators, with-out requiring interaction force sensing. Index Terms—Feedforward force, human motor control, impedance, robotic control. I

Institute of Transport Research:Publications

CiteSeerX

Crossref

Plymouth Electronic Archive and Research Library