Search CORE

6 research outputs found

Policy search for motor primitives in robotics

Author: A. El-Fakdi
A. J. Ijspeert
A. J. Ijspeert
A. P. Dempster
A. Y. Ng
A. Y. Ng
C. Andrieu
C. G. Atkeson
C. Sumners
D. E. Kirk
D. H. Park
E. A. Theodorou
F. Guenter
F. Sehnke
G. J. McLachan
G. Lawrence
G. Wulf
H. Attias
H. J. A. Martín
H. Miyamoto
I. Fantoni
I. Kwee
J. Bagnell
J. Bagnell
J. Binder
J. Kober
J. Kober
J. Kober
J. Peters
J. Peters
J. Peters
J. Peters
Jan Peters
Jens Kober
K. Takenaka
M. E. Taylor
M. Hoffman
M. Strens
M. Toussaint
N. Vlassis
P. Dayan
P. Kormushev
R. J. Williams
R. S. Sutton
R. S. Sutton
R. Sutton
R. Tedrake
S. Chiappa
S. Sato
S. Schaal
S. Schaal
S. Schaal
T. Jaakkola
T. Rückstieß
V. Gullapalli
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Reply

Author: Liu F
Rückstieß T
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Learning a humanoid kick with controlled distance

Author: E Theodorou
JM Wang
M Depinet
N Hansen
R Ferreira
T Rückstieß
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

We investigate the learning of a flexible humanoid robot kick controller, i.e., the controller should be applicable for multiple contexts, such as different kick distances, initial robot position with respect to the ball or both. Current approaches typically tune or optimise the parameters of the biped kick controller for a single context, such as a kick with longest distance or a kick with a specific distance. Hence our research question is that, how can we obtain a flexible kick controller that controls the robot (near) optimally for a continuous range of kick distances? The goal is to find a parametric function that given a desired kick distance, outputs the (near) optimal controller parameters. We achieve the desired flexibility of the controller by applying a contextual policy search method. With such a contextual policy search algorithm, we can generalize the robot kick controller for different distances, where the desired distance is described by a real-valued vector. We will also show that the optimal parameters of the kick controller is a non-linear function of the desired distances and a linear function will fail to properly generalize the kick controller over desired kick distances.FCT - Fundação para a Ciência e a Tecnologia(PEst-OE/EEI/UI0027/2013)info:eu-repo/semantics/publishedVersio

Universidade do Minho: RepositoriUM

Crossref

Baseline-Free Sampling in Parameter Exploring Policy Gradients: Super Symmetric PGPE

Author: E. Greensmith
F. Sehnke
F. Sehnke
G.S. Fishman
J. Schmidhuber
M. Grüttner
R.J. Williams
R.S. Sutton
T. Rückstieß
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Crossref

A Natural Evolution Strategy for Multi-Objective Optimization

Author: A.C. Coello Coello
C. Igel
D. Wierstra
E. Zitzler
E. Zitzler
J. Knowles
K. Deb
N. Hansen
S. Kern
T. Rückstieß
Publication venue
Publication date: 01/01/2010
Field of study

Abstract. The recently introduced family of natural evolution strategies (NES), a novel stochastic descent method employing the natural gradient, is providing a more principled alternative to the well-known covariance matrix adaptation evolution strategy (CMA-ES). Until now, NES could only be used for single-objective optimization. This paper extends the approach to the multi-objective case, by first deriving a (1+1) hillclimber version of NES which is then used as the core component of a multi-objective optimization algorithm. We empirically evaluate the approach on a battery of benchmark functions and find it to be competitive with the state-of-the-art.

CiteSeerX

Crossref

Reinforcement Learning in Robotics: A Survey

Author: A. Coates
A.G. Barto
B.D. Argall
C. Atkeson
C. Touzet
D.E. Kirk
F. Guenter
G. Endo
H. Benbrahim
H. Miyamoto
J. Morimoto
J. Nakanishi
J. Peters
J. Peters
J. Peters
J.T. Betts
J.Y. Donnart
K.J. Åström
L. Buşoniu
L. Paletta
L.P. Kaelbling
M. Asada
M. Riedmiller
M.-A. Sato
M.J. Mataric
M.M. Svinin
M.S. Erden
N. Vlassis
O. Kroemer
P. Dayan
R. Sutton
R. Tedrake
R.E. Bellman
R.E. Bellman
R.E. Bellman
R.J. Williams
S. Mahadevan
S. Schaal
S. Schaal
S. Schaal
S. Thrun
T. Latzke
T. Rückstieß
T. Tamei
T. Yasuda
V. Gullapalli
Y. Duan
Y. Duan
Z. Kalmár
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref