Search CORE

5,556 research outputs found

Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks

Author: Higuera Juan Camilo Gamboa
Meger David
Precup Doina
Thakur Sanjay
van Hoof Herke
Publication venue
Publication date: 01/01/2019
Field of study

Diversity of environments is a key challenge that causes learned robotic controllers to fail due to the discrepancies between the training and evaluation conditions. Training from demonstrations in various conditions can mitigate---but not completely prevent---such failures. Learned controllers such as neural networks typically do not have a notion of uncertainty that allows to diagnose an offset between training and testing conditions, and potentially intervene. In this work, we propose to use Bayesian Neural Networks, which have such a notion of uncertainty. We show that uncertainty can be leveraged to consistently detect situations in high-dimensional simulated and real robotic domains in which the performance of the learned controller would be sub-par. Also, we show that such an uncertainty based solution allows making an informed decision about when to invoke a fallback strategy. One fallback strategy is to request more data. We empirically show that providing data only when requested results in increased data-efficiency.Comment: Copyright 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other work

arXiv.org e-Print Archive

UvA-DARE

Projective simulation for artificial intelligence

Author: Andy Clark
AP Hines
D Ormoneit
DanielL Schacter
David Deutsch
EdwardC Tolman
Ende Tulving
Germund Hesslow
Hendrik Weimer
Igor Antonov
JT Barreiro
Julia Kempe
Jun Tani
Lev Grover
Long-Ji Lin
M Toussaint
MartinV Butz
PreetiS Sareen
R Parr
Richard Feynman
RS Sutton
Sebastian Diehl
TG Dietterich
Publication venue
Publication date: 08/02/2013
Field of study

We propose a model of a learning agent whose interaction with the environment is governed by a simulation-based projection, which allows the agent to project itself into future situations before it takes real action. Projective simulation is based on a random walk through a network of clips, which are elementary patches of episodic memory. The network of clips changes dynamically, both due to new perceptual input and due to certain compositional principles of the simulation process. During simulation, the clips are screened for specific features which trigger factual action of the agent. The scheme is different from other, computational, notions of simulation, and it provides a new element in an embodied cognitive science approach to intelligent action and learning. Our model provides a natural route for generalization to quantum-mechanical operation and connects the fields of reinforcement learning and quantum computation.Comment: 22 pages, 18 figures. Close to published version, with footnotes retaine

arXiv.org e-Print Archive

Towards Continual Reinforcement Learning: A Review and Perspectives

Author: Khetarpal Khimya
Precup Doina
Riemer Matthew
Rish Irina
Publication venue
Publication date: 24/12/2020
Field of study

In this article, we aim to provide a literature review of different formulations and approaches to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We begin by discussing our perspective on why RL is a natural fit for studying continual learning. We then provide a taxonomy of different continual RL formulations and mathematically characterize the non-stationary dynamics of each setting. We go on to discuss evaluation of continual RL agents, providing an overview of benchmarks used in the literature and important metrics for understanding agent performance. Finally, we highlight open problems and challenges in bridging the gap between the current state of continual RL and findings in neuroscience. While still in its early days, the study of continual RL has the promise to develop better incremental reinforcement learners that can function in increasingly realistic applications where non-stationarity plays a vital role. These include applications such as those in the fields of healthcare, education, logistics, and robotics.Comment: Preprint, 52 pages, 8 figure

arXiv.org e-Print Archive