Search CORE

8 research outputs found

Review of the techniques used in motor‐cognitive human‐robot skill transfer

Author: Adams R.P.
Akgun B.
Argall B.D.
Bajcsy R.
Billard A.
Bobu A.
Bohg J.
Bullock I.M.
Calinon S.
Calinon S.
Calinon S.
Calinon S.
Calinon S.
Calinon S.
Calinon S.
Chernova S.
Cleveland W.S.
Dempster A.P.
Duque D.A.
Feix T.
Forte D.
Fox E.
Fox E.B.
Gams A.
Gašpar T.
Havoutis I.
Hersch M.
Hovland G.E.
Ijspeert A.J.
Ijspeert A.J.
Khansari.Zadeh S.M.
Khansari‐Zadeh S.M.
Kober J.
Kober J.
Konidaris G.
Kruger V.
Kulvicius T.
Lauretti C.
Lawrence D.A.
Lee D.
Li C.
Li J.
Li K.
Lin H.
Liu Y.
Lopes M.
Maaten L.V.D.
Maeda G.J.
Muratore F.
Mühlig M.
Mülling K.
Nakanishi J.
Nehaniv C.L.
Nemec B.
Niechwiadowicz K.
Pardowitz M.
Patel M.
Qiao H.
Qiao H.
Rabiner L.R.
Ragaglia M.
Ramachandran D.
Ravichandar H.
Rozo L.
Rückert E.
Sakoe H.
Salaris P.
Schaal S.
Shiarlis K.
Silver D.
Song Y.C.
Strang G.
Stulp F.
Takano W.
Ude A.
Ugur E.
Vijayakumar S.
Vlachos K.
Wang N.
Wang N.
Wettels N.
Whitehead S.D.
Whiten A.
Whiten A.
Wrede S.
Wörgötter F.
Xu Y.
Yang C.
Yang C.
Yang C.
Zeng C.
Zeng C.
Zhang F.
Zhao Y.
Zhu Z.
Şahin E.
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/09/2021
Field of study

Abstract A conventional robot programming method extensively limits the reusability of skills in the developmental aspect. Engineers programme a robot in a targeted manner for the realisation of predefined skills. The low reusability of general‐purpose robot skills is mainly reflected in inability in novel and complex scenarios. Skill transfer aims to transfer human skills to general‐purpose manipulators or mobile robots to replicate human‐like behaviours. Skill transfer methods that are commonly used at present, such as learning from demonstrated (LfD) or imitation learning, endow the robot with the expert's low‐level motor and high‐level decision‐making ability, so that skills can be reproduced and generalised according to perceived context. The improvement of robot cognition usually relates to an improvement in the autonomous high‐level decision‐making ability. Based on the idea of establishing a generic or specialised robot skill library, robots are expected to autonomously reason about the needs for using skills and plan compound movements according to sensory input. In recent years, in this area, many successful studies have demonstrated their effectiveness. Herein, a detailed review is provided on the transferring techniques of skills, applications, advancements, and limitations, especially in the LfD. Future research directions are also suggested

Crossref

Directory of Open Access Journals

UWE Bristol Research Repository

Inverse reinforcement learning from failure

Author: Messias J
Shiarlis K
Whiteson SA
Publication venue: International Foundation for Autonomous Agents and Multiagent Systems
Publication date: 01/01/2016
Field of study

Inverse reinforcement learning (IRL) allows autonomous agents to learn to solve complex tasks from successful demonstrations. However, in many settings, e.g., when a human learns the task by trial and error, failed demonstrations are also readily available. In addition, in some tasks, purposely generating failed demonstrations may be easier than generating successful ones. Since existing IRL methods cannot make use of failed demonstrations, in this paper we propose inverse reinforcement learning from failure (IRLF) which exploits both successful and failed demonstrations. Starting from the state-of-the-art maximum causal entropy IRL method, we propose a new constrained optimisation formulation that accommodates both types of demonstrations while remaining convex. We then derive update rules for learning reward functions and policies. Experiments on both simulated and real-robot data demonstrate that IRLF converges faster and generalises better than maximum causal entropy IRL, especially when few successful demonstrations are available

Oxford University Research Archive

Rapidly exploring learning trees

Author: Messias J
Shiarlis K
Whiteson S
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

Inverse Reinforcement Learning (IRL) for path planning enables robots to learn cost functions for difficult tasks from demonstration, instead of hard-coding them. However, IRL methods face practical limitations that stem from the need to repeat costly planning procedures. In this paper, we propose Rapidly Exploring Learning Trees (RLT∗ ), which learns the cost functions of Optimal Rapidly Exploring Random Trees (RRT∗ ) from demonstration, thereby making inverse learning methods applicable to more complex tasks. Our approach extends Maximum Margin Planning to work with RRT∗ cost functions. Furthermore, we propose a caching scheme that greatly reduces the computational cost of this approach. Experimental results on simulated and real-robot data from a social navigation scenario show that RLT∗ achieves better performance at lower computational cost than existing methods. We also successfully deploy control policies learned with RLT∗ on a real telepresence robot

Crossref

Oxford University Research Archive

Inverse reinforcement learning from failure

Author: Messias J
Shiarlis K
Whiteson SA
Whiteson SA
Publication venue
Publication date: 01/01/2016
Field of study

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning

Author: Gal Y
Hofmann K
Igl M
Schulze S
Shiarlis K
Whiteson S
Zintgraf L
Publication venue: International Conference on Learning Representations
Publication date: 01/01/2020
Field of study

Trading off exploration and exploitation in an unknown environment is key to maximising expected return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but on the agent’s uncertainty about the environment. Computing a Bayes-optimal policy is however intractable for all but the smallest tasks. In this paper, we introduce variational Bayes-Adaptive Deep RL (variBAD), a way to meta-learn to perform approximate inference in an unknown environment, and incorporate task uncer- tainty directly during action selection. In a grid-world domain, we illustrate how variBAD performs structured online exploration as a function of task uncertainty. We further evaluate variBAD on MuJoCo domains widely used in meta-RL and show that it achieves higher online return than existing methods

Oxford University Research Archive

TERESA: A Socially Intelligent Semi-autonomous Telepresence System

Author: Caballero F.
Englebienne G.
Evers V.
Hedman L.
Kim J.
Koster R.
Merino L.
Messias J.
Michel H.
Pantic M.
Petridis S.
Pérez-Higueras N.
Pérez-Hurtado I.
Ramon-Vigo R.
Scherlund M.
Shen J.
Shiarlis K.
Truong K.
van Someren M.
Vroon J.
Whiteson S.
Publication venue
Publication date: 01/01/2015
Field of study

TERESA is a socially intelligent semi-autonomous telepresence system that is currently being developed as part of an FP7-STREP project funded by the European Union. The ultimate goal of the project is to deploy this system in an elderly day centre to allow elderly people to participate in social events even when they are unable to travel to the centre. In this paper, we present an overview of our progress on TERESA. We discuss the most significant scientific and technical challenges including: understanding and automati-cally recognizing social behaviour; defining social norms for the interaction between a telepresence robot and its users; navigating the environment while taking into account social features and constraints; and learning to estimate the social impact of the robot’s actions from multiple sources of feedback. We report on our current progress on each of these chal-lenges, as well as our plans for future work

University of Twente Research Information

UvA-DARE

International Migration, Integration and Social Cohesion online publications

TERESA: A Socially Intelligent Semi-autonomous Telepresence System

Author: Caballero F.
Englebienne G.
Evers V.
Hedman L.
Kim J.
Koster R.
Merino L.
Messias J.
Michel H.
Pantic M.
Petridis S.
Pérez-Higueras N.
Pérez-Hurtado I.
Ramon-Vigo R.
Scherlund M.
Shen J.
Shiarlis K.
Truong K.
van Someren M.
Vroon J.
Whiteson S.
Publication venue
Publication date
Field of study

UvA-DARE

Transfer Learning for Multiagent Reinforcement Learning Systems

Author: Abadi Martín
Abel David
Agarwal Akshat
Albrecht Stefano V.
Alexander
Amir Ofra
Argall Brenna D.
Argente Estefania
Badue Claudine
Banerjee Bikramjit
Banerjee Bikramjit
Barrett Samuel
Barto Andrew G.
Bazzan Ana L. C.
Behboudian Paniz
Bengio Yoshua
Berner Christopher
Bianchi Reinaldo
Bianchi Reinaldo A. C.
Bianchi Reinaldo A. C.
Bignold Adam
Bogg Paul
Boutsioukis Georgios
Bradley Knox W.
Braylan Alexander
Brys Tim
Brys Tim
Busoniu Lucian
Capobianco Roberto
Castaneda Alvaro Ovalle
Cederborg Thomas
Chernova Sonia
Chernova Sonia
Chernova Sonia
Cobo Luis C.
Croonenborghs Tom
Cui Yuchen
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Danilo
de Cote Enrique Munoz
De Hauwere Y-M.
de la Cruz Gabriel V.
Devailly François-Xavier
Devin Coline
Devlin Sam
Devlin Sam
Didi Sabre
Dietterich Thomas G.
Diuk Carlos
Du Yunshu
Dusparic Ivana
Fang Zhou
Fernández Fernando
Fitzgerald Tesca
Florensa Carlos
Floyd Michael W.
Foerster Jakob
Foerster Jakob N.
Foerster Jakob N.
Freire Valdinei
Glatt Ruben
Goldberg David E.
Goodfellow Ian J.
Griffith Shane
Gupta Abhishek
Gupta Jayesh K.
Hanna Josiah
Hausknecht Matthew
Hausknecht Matthew
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hou Yaqing
Hu Junling
Hu Yujing
Hu Yujing
Hu Yujing
Ilhan Ercüment
Isele David
Jordan Scott M.
Judah Kshitij
Judah Kshitij
Judah Kshitij
Kelly Stephen
Kersting Kristian
Kim Dong-Ki
Kitano Hiroaki
Kober Jens
Koga M. L.
Koga Marcelo Li
Kolter J. Zico
Konidaris George
Kono Hitoshi
Krening Samantha
Lai Kwei-Herng
Lauer Martin
Le Hoang Minh
Leibo Joel Z.
Li Lihong
Liang Eric
Lin Xiaomin
Littman Michael L.
Littman Michael L.
Lopes Manuel
Lowe Ryan
Lyu Xueguang
MacGlashan James
MacGlashan James
Maclin Richard
Madden Michael G.
Mandel Travis
Martin
Matiisen Tambet
Matthew
MDP
Melo Francisco S.
Mnih Volodymyr
Narvekar Sanmit
Narvekar Sanmit
Narvekar Sanmit
Natarajan Sriraam
Ng Andrew Y.
Nguyen Thanh Thi
Omidshafiei Shayegan
Omidshafiei Shayegan
Pan Sinno J.
Panait Liviu
Paszke Adam
Peng Bei
Peng Bei
Pinto Lerrel
Poole David L.
Price Bob
Price Bob
Proper Scott
Puterman Martin L.
Ramachandran Deepak
Ramakrishnan Ramya
Reddy Tummalapalli Sudhamsh
Rosenfeld Ariel
Ryu Heechang
Sakato Tatsuya
Schaal Stefan
Schulman John
Schulman John
Shiarlis Kyriacos
Shoham Yoav
Shon Aaron P.
Shortreed Susan M.
Silver David
Sinapov Jivko
Sodomka Eric
Souza Lucas Oliveira
Stanley Kenneth O.
Stone Peter
Stone Peter
Stone Peter
Suay Halit Bener
Subramanian Kaushik
Subramanian Sriram Ganapathi
Sukhbaatar Sainbayar
Sukhbaatar Sainbayar
Sutton Richard S.
Svetlik Maxwell
Tamassia Marco
Tan Ming
Tangkaratt Voot
Tangkaratt Voot
Tanner Brian
Taylor Adam
Taylor Adam
Taylor Matthew E.
Taylor Matthew E.
Taylor Matthew E.
Tesauro Gerald
Thrun Sebastian
Todorov Emanuel
Torabi Faraz
Torabi Faraz
Torrey Lisa
Vamplew Peter
Vinyals Oriol
Vrancx Peter
Walsh Thomas J.
Wang Zhaodong
Watkins Christopher J.
Wiewiora Eric
Wooldridge Michael J.
Xiong Yanhai
Yang Tianpei
Yang Yaodong
Zhan Yusen
Zhifei Shao
Zhou L.
Zhou Ming
Zhu Changxi
Zimmer Matthieu
Publication venue: 'Morgan & Claypool Publishers LLC'
Publication date
Field of study

Crossref