Search CORE

20,227 research outputs found

Multi-Task Policy Search for Robotics

Author: Deisenroth MP
Englert P
Fox D
Peters J
Publication venue
Publication date: 01/01/2014
Field of study

© 2014 IEEE.Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in realrobot experiments are shown

TUbiblio

Crossref

Spiral - Imperial College Digital Repository

MPG.PuRe

Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning

Author: Chebotar Yevgen
Hausman Karol
Levine Sergey
Schaal Stefan
Sukhatme Gaurav
Zhang Marvin
Publication venue
Publication date: 18/06/2017
Field of study

Reinforcement learning (RL) algorithms for real-world robotic applications need a data-efficient learning process and the ability to handle complex, unknown dynamical systems. These requirements are handled well by model-based and model-free RL approaches, respectively. In this work, we aim to combine the advantages of these two types of methods in a principled manner. By focusing on time-varying linear-Gaussian policies, we enable a model-based algorithm based on the linear quadratic regulator (LQR) that can be integrated into the model-free framework of path integral policy improvement (PI2). We can further combine our method with guided policy search (GPS) to train arbitrary parameterized policies such as deep neural networks. Our simulation and real-world experiments demonstrate that this method can solve challenging manipulation tasks with comparable or better performance than model-free methods while maintaining the sample efficiency of model-based methods. A video presenting our results is available at https://sites.google.com/site/icml17pilqrComment: Paper accepted to the International Conference on Machine Learning (ICML) 201

arXiv.org e-Print Archive

MPG.PuRe

Automatic LQR Tuning Based on Gaussian Process Global Optimization

Author: Bohg Jeannette
Hennig Philipp
Marco Alonso
Schaal Stefan
Trimpe Sebastian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/05/2016
Field of study

This paper proposes an automatic controller tuning framework based on linear optimal control combined with Bayesian optimization. With this framework, an initial set of controller gains is automatically improved according to a pre-defined performance objective evaluated from experimental data. The underlying Bayesian optimization algorithm is Entropy Search, which represents the latent objective as a Gaussian process and constructs an explicit belief over the location of the objective minimum. This is used to maximize the information gain from each experimental evaluation. Thus, this framework shall yield improved controllers with fewer evaluations compared to alternative approaches. A seven-degree-of-freedom robot arm balancing an inverted pole is used as the experimental demonstrator. Results of a two- and four-dimensional tuning problems highlight the method's potential for automatic controller tuning on robotic platforms.Comment: 8 pages, 5 figures, to appear in IEEE 2016 International Conference on Robotics and Automation. Video demonstration of the experiments available at https://am.is.tuebingen.mpg.de/publications/marco_icra_201

arXiv.org e-Print Archive

MPG.PuRe

Learning Contact-Rich Manipulation Skills with Guided Policy Search

Author: Abbeel Pieter
Levine Sergey
Wagener Nolan
Publication venue
Publication date: 26/02/2015
Field of study

Autonomous learning of object manipulation skills can enable robots to acquire rich behavioral repertoires that scale to the variety of objects found in the real world. However, current motion skill learning methods typically restrict the behavior to a compact, low-dimensional representation, limiting its expressiveness and generality. In this paper, we extend a recently developed policy search method \cite{la-lnnpg-14} and use it to learn a range of dynamic manipulation behaviors with highly general policy representations, without using known models or example demonstrations. Our approach learns a set of trajectories for the desired motion skill by using iteratively refitted time-varying linear models, and then unifies these trajectories into a single control policy that can generalize to new situations. To enable this method to run on a real robot, we introduce several improvements that reduce the sample count and automate parameter selection. We show that our method can acquire fast, fluent behaviors after only minutes of interaction time, and can learn robust controllers for complex tasks, including putting together a toy airplane, stacking tight-fitting lego blocks, placing wooden rings onto tight-fitting pegs, inserting a shoe tree into a shoe, and screwing bottle caps onto bottles

arXiv.org e-Print Archive

Crossref

Real Time Animation of Virtual Humans: A Trade-off Between Naturalness and Control

Author: Abe
Abe
Ahmed
Allen
Amaya
Arikan
Badler
Barzel
Bodenheimer
Boeing
Boulic
Brand
Bruderlin
Callennec
Carvalho
Chaminade
Chao
Chi
Coros
Da Silva
Da Silva
Egges
Egges
Egges
Egges
Faloutsos
Fang
Feldman
Fitts
Flash
Forsyth
Gibet
Gibet
Glardon
Gleicher
Gleicher
Gleicher
Grassia
Gratch
Grochow
Ha
Harris
Hartmann
Hartmann
Heck
Heck
Heck
Hodgins
Hsu
Ik Soo
Ikemoto
Ikemoto
Isaacs
Jang
Jansen
Kallmann
Kawato
Ko
Kokkevis
Kopp
Kopp
Kovar
Kovar
Kovar
Kovar
Lance
Lau
Lee
Lee
Lee
Lee
Lee
Lee
Li
Liu
Lo
Macmillan
Magnenat-Thalmann
Mandel
Mirtich
Mizuguchi
Muico
Mukai
Ménardais
Neff
Neff
Neff
Neff
Neff
Neff
Noot
Oore
Oshita
Perlin
Perlin
Pollard
Reeves
Reitsma
Reitsma
Reitsma
Ren
Rose
Rose
Ruttkay
Safonova
Schmidt
Shapiro
Shapiro
Shapiro
Shapiro
Sharon
Shin
Shin
Sims
Stewart
Thiebaux
Tolani
Torresani
Treuille
Uno
Unuma
Urtasun
Van Basten
Van Basten
Van Basten
Van Welbergen
Viviani
Wiley
Winter
Witkin
Woodson
Wooten
Wooten
Wrotek
Yin
Yin
Yin
Zeltzer
Zhao
Zordan
Zordan
Zordan
Zordan
Publication venue: Blackwell Publishing
Publication date: 01/01/2010
Field of study

Virtual humans are employed in many interactive applications using 3D virtual environments, including (serious) games. The motion of such virtual humans should look realistic (or ‘natural’) and allow interaction with the surroundings and other (virtual) humans. Current animation techniques differ in the trade-off they offer between motion naturalness and the control that can be exerted over the motion. We show mechanisms to parametrize, combine (on different body parts) and concatenate motions generated by different animation techniques. We discuss several aspects of motion naturalness and show how it can be evaluated. We conclude by showing the promise of combinations of different animation paradigms to enhance both naturalness and control

Crossref

Publications at Bielefeld University

University of Twente Research Information

System Level Synthesis

Author: Anderson James
Doyle John C.
Low Steven
Matni Nikolai
Publication venue
Publication date: 02/04/2019
Field of study

This article surveys the System Level Synthesis framework, which presents a novel perspective on constrained robust and optimal controller synthesis for linear systems. We show how SLS shifts the controller synthesis task from the design of a controller to the design of the entire closed loop system, and highlight the benefits of this approach in terms of scalability and transparency. We emphasize two particular applications of SLS, namely large-scale distributed optimal control and robust control. In the case of distributed control, we show how SLS allows for localized controllers to be computed, extending robust and optimal control methods to large-scale systems under practical and realistic assumptions. In the case of robust control, we show how SLS allows for novel design methodologies that, for the first time, quantify the degradation in performance of a robust controller due to model uncertainty -- such transparency is key in allowing robust control methods to interact, in a principled way, with modern techniques from machine learning and statistical inference. Throughout, we emphasize practical and efficient computational solutions, and demonstrate our methods on easy to understand case studies.Comment: To appear in Annual Reviews in Contro

arXiv.org e-Print Archive

Caltech Authors