Search CORE

728,455 research outputs found

Dundee Discussion Papers in Economics 115:Positive confirmation bias in the acquisition of information

Author: Jones Martin
Sugden Robert
Publication venue: 'University of Dundee'
Publication date: 01/01/2000
Field of study

University of Dundee Online Publications

Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

Author: A Peyrache
AK Lee
AS Gupta
AW Moore
DJ Foster
G Girardeau
G Lavilléon De
H Eichenbaum
J O’Keefe
J Peng
JL McClelland
LH Lin
M Khamassi
MA Wilson
R Sutton
RA Jacobs
Richard S. Sutton
V Paz-Villagrán
Z Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/08/2018
Field of study

During sleep and awake rest, the hippocampus replays sequences of place cells that have been activated during prior experiences. These have been interpreted as a memory consolidation process, but recent results suggest a possible interpretation in terms of reinforcement learning. The Dyna reinforcement learning algorithms use off-line replays to improve learning. Under limited replay budget, a prioritized sweeping approach, which requires a model of the transitions to the predecessors, can be used to improve performance. We investigate whether such algorithms can explain the experimentally observed replays. We propose a neural network version of prioritized sweeping Q-learning, for which we developed a growing multiple expert algorithm, able to cope with multiple predecessors. The resulting architecture is able to improve the learning of simulated agents confronted to a navigation task. We predict that, in animals, learning the world model should occur during rest periods, and that the corresponding replays should be shuffled.Comment: Living Machines 2018 (Paris, France

arXiv.org e-Print Archive

Crossref

Regularising Non-linear Models Using Feature Side-information

Author: Kalousis Alexandros
Mollaysa Amina
Strasser Pablo
Publication venue
Publication date: 07/03/2017
Field of study

Very often features come with their own vectorial descriptions which provide detailed information about their properties. We refer to these vectorial descriptions as feature side-information. In the standard learning scenario, input is represented as a vector of features and the feature side-information is most often ignored or used only for feature selection prior to model fitting. We believe that feature side-information which carries information about features intrinsic property will help improve model prediction if used in a proper way during learning process. In this paper, we propose a framework that allows for the incorporation of the feature side-information during the learning of very general model families to improve the prediction performance. We control the structures of the learned models so that they reflect features similarities as these are defined on the basis of the side-information. We perform experiments on a number of benchmark datasets which show significant predictive performance gains, over a number of baselines, as a result of the exploitation of the side-information.Comment: 11 page with appendi

arXiv.org e-Print Archive

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)