Search CORE

8 research outputs found

Recommended from our members

TRANSFER IN DEEP REINFORCEMENT LEARNING: HOW AN AGENT CAN LEVERAGE KNOWLEDGE FROM ANOTHER AGENT, A HUMAN, OR ITSELF

Author: Du Yunshu
Publication venue: Washington State University
Publication date: 01/01/2021
Field of study

While capable of achieving state-of-the-art performance in complex sequential tasks, deep reinforcement learning (deep RL) remains extremely data inefficient and slow to train. This slow learning speed poses challenges for applying deep RL to real-world situations, especially when poor initial performance is unacceptable or even dangerous. Many approaches have been studied to tackle this problem and one widely used method is transfer learning (TL). The principle of TL is that knowledge acquired from a source agent can be leveraged to assist learning in a different but related target task. This dissertation proposes three types of TL techniques to speed up the learning of a deep RL agent. Specifically, we demonstrate that knowledge can be transferred agent-to-agent, human-to-agent, and self-to-agent.First, we show that positive transfer can be achieved between two cross-domain agents via direct weight copying if they share visual similarities. Second, we study various pre-training methods using a set of human demonstrations to perform the human-to-agent transfer. Pre-training significantly speeds up the agent's learning. Third, we explore knowledge transfer from the agent to itself via a novel experience replay framework, namely Lucid Dreaming for Experience Replay (LiDER), in which past experiences are constantly refreshed. Results suggest that the agent can achieve much better performance within the same amount of training data compared to the case without replaying refreshed experiences. Two extensions of the LiDER framework also enable agent-to-agent and human-to-agent transfer, making it a powerful tool to perform all three types of transfer

Washington State University institutional repository

Analysis of University Fitness Center Data Uncovers Interesting Patterns, Enables Prediction

Author: Assefaw H. Gebremedhin
Matthew E. Taylor
Yunshu Du
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Transfer Learning for Multiagent Reinforcement Learning Systems

Author: Abadi Martín
Abel David
Agarwal Akshat
Albrecht Stefano V.
Alexander
Amir Ofra
Argall Brenna D.
Argente Estefania
Badue Claudine
Banerjee Bikramjit
Banerjee Bikramjit
Barrett Samuel
Barto Andrew G.
Bazzan Ana L. C.
Behboudian Paniz
Bengio Yoshua
Berner Christopher
Bianchi Reinaldo
Bianchi Reinaldo A. C.
Bianchi Reinaldo A. C.
Bignold Adam
Bogg Paul
Boutsioukis Georgios
Bradley Knox W.
Braylan Alexander
Brys Tim
Brys Tim
Busoniu Lucian
Capobianco Roberto
Castaneda Alvaro Ovalle
Cederborg Thomas
Chernova Sonia
Chernova Sonia
Chernova Sonia
Cobo Luis C.
Croonenborghs Tom
Cui Yuchen
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Da Silva Felipe Leno
Danilo
de Cote Enrique Munoz
De Hauwere Y-M.
de la Cruz Gabriel V.
Devailly François-Xavier
Devin Coline
Devlin Sam
Devlin Sam
Didi Sabre
Dietterich Thomas G.
Diuk Carlos
Du Yunshu
Dusparic Ivana
Fang Zhou
Fernández Fernando
Fitzgerald Tesca
Florensa Carlos
Floyd Michael W.
Foerster Jakob
Foerster Jakob N.
Foerster Jakob N.
Freire Valdinei
Glatt Ruben
Goldberg David E.
Goodfellow Ian J.
Griffith Shane
Gupta Abhishek
Gupta Jayesh K.
Hanna Josiah
Hausknecht Matthew
Hausknecht Matthew
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hernandez-Leal Pablo
Hou Yaqing
Hu Junling
Hu Yujing
Hu Yujing
Hu Yujing
Ilhan Ercüment
Isele David
Jordan Scott M.
Judah Kshitij
Judah Kshitij
Judah Kshitij
Kelly Stephen
Kersting Kristian
Kim Dong-Ki
Kitano Hiroaki
Kober Jens
Koga M. L.
Koga Marcelo Li
Kolter J. Zico
Konidaris George
Kono Hitoshi
Krening Samantha
Lai Kwei-Herng
Lauer Martin
Le Hoang Minh
Leibo Joel Z.
Li Lihong
Liang Eric
Lin Xiaomin
Littman Michael L.
Littman Michael L.
Lopes Manuel
Lowe Ryan
Lyu Xueguang
MacGlashan James
MacGlashan James
Maclin Richard
Madden Michael G.
Mandel Travis
Martin
Matiisen Tambet
Matthew
MDP
Melo Francisco S.
Mnih Volodymyr
Narvekar Sanmit
Narvekar Sanmit
Narvekar Sanmit
Natarajan Sriraam
Ng Andrew Y.
Nguyen Thanh Thi
Omidshafiei Shayegan
Omidshafiei Shayegan
Pan Sinno J.
Panait Liviu
Paszke Adam
Peng Bei
Peng Bei
Pinto Lerrel
Poole David L.
Price Bob
Price Bob
Proper Scott
Puterman Martin L.
Ramachandran Deepak
Ramakrishnan Ramya
Reddy Tummalapalli Sudhamsh
Rosenfeld Ariel
Ryu Heechang
Sakato Tatsuya
Schaal Stefan
Schulman John
Schulman John
Shiarlis Kyriacos
Shoham Yoav
Shon Aaron P.
Shortreed Susan M.
Silver David
Sinapov Jivko
Sodomka Eric
Souza Lucas Oliveira
Stanley Kenneth O.
Stone Peter
Stone Peter
Stone Peter
Suay Halit Bener
Subramanian Kaushik
Subramanian Sriram Ganapathi
Sukhbaatar Sainbayar
Sukhbaatar Sainbayar
Sutton Richard S.
Svetlik Maxwell
Tamassia Marco
Tan Ming
Tangkaratt Voot
Tangkaratt Voot
Tanner Brian
Taylor Adam
Taylor Adam
Taylor Matthew E.
Taylor Matthew E.
Taylor Matthew E.
Tesauro Gerald
Thrun Sebastian
Todorov Emanuel
Torabi Faraz
Torabi Faraz
Torrey Lisa
Vamplew Peter
Vinyals Oriol
Vrancx Peter
Walsh Thomas J.
Wang Zhaodong
Watkins Christopher J.
Wiewiora Eric
Wooldridge Michael J.
Xiong Yanhai
Yang Tianpei
Yang Yaodong
Zhan Yusen
Zhifei Shao
Zhou L.
Zhou Ming
Zhu Changxi
Zimmer Matthieu
Publication venue: 'Morgan & Claypool Publishers LLC'
Publication date
Field of study

Crossref