Search CORE

3 research outputs found

Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning

Author: Behboudian Paniz
Satsangi Yash
Publication venue
Publication date: 14/04/2023
Field of study

A key challenge for a reinforcement learning (RL) agent is to incorporate external/expert1 advice in its learning. The desired goals of an algorithm that can shape the learning of an RL agent with external advice include (a) maintaining policy invariance; (b) accelerating the learning of the agent; and (c) learning from arbitrary advice [3]. To address this challenge this paper formulates the problem of incorporating external advice in RL as a multi-armed bandit called shaping-bandits. The reward of each arm of shaping bandits corresponds to the return obtained by following the expert or by following a default RL algorithm learning on the true environment reward.We show that directly applying existing bandit and shaping algorithms that do not reason about the non-stationary nature of the underlying returns can lead to poor results. Thus we propose UCB-PIES (UPIES), Racing-PIES (RPIES), and Lazy PIES (LPIES) three different shaping algorithms built on different assumptions that reason about the long-term consequences of following the expert policy or the default RL algorithm. Our experiments in four different settings show that these proposed algorithms achieve the above-mentioned goals whereas the other algorithms fail to do so.Comment: ALA workshop, AAMAS 202

arXiv.org e-Print Archive

Transfer Learning for Multiagent Reinforcement Learning Systems

Author: Abadi Martín
Abel David
Agarwal Akshat
Albrecht Stefano V.
Alexander
Amir Ofra
Argall Brenna D.
Argente Estefania
Badue Claudine
Banerjee Bikramjit
Banerjee Bikramjit
Barrett Samuel
Barto Andrew G.
Bazzan Ana L. C.
Behboudian Paniz
Bengio Yoshua
Berner Christopher
Bianchi Reinaldo
Bianchi Reinaldo A. C.
Bianchi Reinaldo A. C.
Bignold Adam
Bogg Paul
Boutsioukis Georgios
Bradley Knox W.
Braylan Alexander
Brys Tim
Brys Tim
Busoniu Lucian
Capobianco Roberto
Castaneda Alvaro Ovalle
Cederborg Thomas
Chernova Sonia
Chernova Sonia
Chernova Sonia
Cobo Luis C.
Croonenborghs Tom
Cui Yuchen
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Danilo
de Cote Enrique Munoz
De Hauwere Y-M.
de la Cruz Gabriel V.
Devailly François-Xavier
Devin Coline
Devlin Sam
Devlin Sam
Didi Sabre
Dietterich Thomas G.
Diuk Carlos
Du Yunshu
Dusparic Ivana
Fang Zhou
Fernández Fernando
Fitzgerald Tesca
Florensa Carlos
Floyd Michael W.
Foerster Jakob
Foerster Jakob N.
Foerster Jakob N.
Freire Valdinei
Glatt Ruben
Goldberg David E.
Goodfellow Ian J.
Griffith Shane
Gupta Abhishek
Gupta Jayesh K.
Hanna Josiah
Hausknecht Matthew
Hausknecht Matthew
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hou Yaqing
Hu Junling
Hu Yujing
Hu Yujing
Hu Yujing
Ilhan Ercüment
Isele David
Jordan Scott M.
Judah Kshitij
Judah Kshitij
Judah Kshitij
Kelly Stephen
Kersting Kristian
Kim Dong-Ki
Kitano Hiroaki
Kober Jens
Koga M. L.
Koga Marcelo Li
Kolter J. Zico
Konidaris George
Kono Hitoshi
Krening Samantha
Lai Kwei-Herng
Lauer Martin
Le Hoang Minh
Leibo Joel Z.
Li Lihong
Liang Eric
Lin Xiaomin
Littman Michael L.
Littman Michael L.
Lopes Manuel
Lowe Ryan
Lyu Xueguang
MacGlashan James
MacGlashan James
Maclin Richard
Madden Michael G.
Mandel Travis
Martin
Matiisen Tambet
Matthew
MDP
Melo Francisco S.
Mnih Volodymyr
Narvekar Sanmit
Narvekar Sanmit
Narvekar Sanmit
Natarajan Sriraam
Ng Andrew Y.
Nguyen Thanh Thi
Omidshafiei Shayegan
Omidshafiei Shayegan
Pan Sinno J.
Panait Liviu
Paszke Adam
Peng Bei
Peng Bei
Pinto Lerrel
Poole David L.
Price Bob
Price Bob
Proper Scott
Puterman Martin L.
Ramachandran Deepak
Ramakrishnan Ramya
Reddy Tummalapalli Sudhamsh
Rosenfeld Ariel
Ryu Heechang
Sakato Tatsuya
Schaal Stefan
Schulman John
Schulman John
Shiarlis Kyriacos
Shoham Yoav
Shon Aaron P.
Shortreed Susan M.
Silver David
Sinapov Jivko
Sodomka Eric
Souza Lucas Oliveira
Stanley Kenneth O.
Stone Peter
Stone Peter
Stone Peter
Suay Halit Bener
Subramanian Kaushik
Subramanian Sriram Ganapathi
Sukhbaatar Sainbayar
Sukhbaatar Sainbayar
Sutton Richard S.
Svetlik Maxwell
Tamassia Marco
Tan Ming
Tangkaratt Voot
Tangkaratt Voot
Tanner Brian
Taylor Adam
Taylor Adam
Taylor Matthew E.
Taylor Matthew E.
Taylor Matthew E.
Tesauro Gerald
Thrun Sebastian
Todorov Emanuel
Torabi Faraz
Torabi Faraz
Torrey Lisa
Vamplew Peter
Vinyals Oriol
Vrancx Peter
Walsh Thomas J.
Wang Zhaodong
Watkins Christopher J.
Wiewiora Eric
Wooldridge Michael J.
Xiong Yanhai
Yang Tianpei
Yang Yaodong
Zhan Yusen
Zhifei Shao
Zhou L.
Zhou Ming
Zhu Changxi
Zimmer Matthieu
Publication venue: 'Morgan & Claypool Publishers LLC'
Publication date
Field of study

Crossref