Search CORE

27,658 research outputs found

Evidence for surprise minimization over value maximization in choice behavior

Author: A Clark
A Pouget
D Kahnemann
E Jaynes
E Shannon
G Loomes
G Pezzulo
H Brown
I Vlaev
J Nash
K Friston
K Friston
K Friston
K Friston
K Friston
K Friston
K Seth
KE Stephan
KE Stephan
L Itti
M Botvinick
M Moutoussis
M Wolpert
ND Wright
P Dayan
P Dayan
P Schwartenbeck
PR Blavatskyy
PR Montague
R Adams
S Kakade
S Klyubin
THB FitzGerald
THB FitzGerald
TL Griffiths
Y Sun
Y Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Classical economic models are predicated on the idea that the ultimate aim of choice is to maximize utility or reward. In contrast, an alternative perspective highlights the fact that adaptive behavior requires agents' to model their environment and minimize surprise about the states they frequent. We propose that choice behavior can be more accurately accounted for by surprise minimization compared to reward or utility maximization alone. Minimizing surprise makes a prediction at variance with expected utility models; namely, that in addition to attaining valuable states, agents attempt to maximize the entropy over outcomes and thus 'keep their options open'. We tested this prediction using a simple binary choice paradigm and show that human decision-making is better explained by surprise minimization compared to utility maximization. Furthermore, we replicated this entropy-seeking behavior in a control task with no explicit utilities. These findings highlight a limitation of purely economic motivations in explaining choice behavior and instead emphasize the importance of belief-based motivations

Paris Lodron University of Salzburg

Crossref

UCL Discovery

PubMed Central

Sissa Digital Library

University of East Anglia digital repository

MPG.PuRe

Deciphering Network Community Structure by Surprise

Author: A Lancichinetti
A Lancichinetti
A Marco
AL Barabási
BJ Breitkreutz
DJ Watts
EC Pielou
Eshel Ben-Jacob
Ignacio Marín
J Duch
JI Lucas
L Danon
LC Freeman
LD Costa
M Girvan
M Meilă
M Rosvall
MEJ Newman
MEJ Newman
MEJ Newman
P Ronhovde
R Aldecoa
RH MacArthur
Rodrigo Aldecoa
S Fortunato
S Fortunato
S Wasserman
SH Strogatz
SY Pu
V Arnau
VD Blondel
WW Zachary
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

The analysis of complex networks permeates all sciences, from biology to sociology. A fundamental, unsolved problem is how to characterize the community structure of a network. Here, using both standard and novel benchmarks, we show that maximization of a simple global parameter, which we call Surprise (S), leads to a very efficient characterization of the community structure of complex synthetic networks. Particularly, S qualitatively outperforms the most commonly used criterion to define communities, Newman and Girvan's modularity (Q). Applying S maximization to real networks often provides natural, well-supported partitions, but also sometimes counterintuitive solutions that expose the limitations of our previous knowledge. These results indicate that it is possible to define an effective global criterion for community structure and open new routes for the understanding of complex networks.Comment: 7 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Digital.CSIC

Automatic Curriculum Learning For Deep RL: A Short Survey

Author: Colas Cédric
Hofmann Katja
Oudeyer Pierre-Yves
Portelas Rémy
Weng Lilian
Publication venue
Publication date: 28/05/2020
Field of study

Automatic Curriculum Learning (ACL) has become a cornerstone of recent successes in Deep Reinforcement Learning (DRL).These methods shape the learning trajectories of agents by challenging them with tasks adapted to their capacities. In recent years, they have been used to improve sample efficiency and asymptotic performance, to organize exploration, to encourage generalization or to solve sparse reward problems, among others. The ambition of this work is dual: 1) to present a compact and accessible introduction to the Automatic Curriculum Learning literature and 2) to draw a bigger picture of the current state of the art in ACL to encourage the cross-breeding of existing concepts and the emergence of new ideas.Comment: Accepted at IJCAI202

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

VIME: Variational Information Maximizing Exploration

Author: Abbeel Pieter
Chen Xi
De Turck Filip
Duan Yan
Houthooft Rein
Schulman John
Publication venue
Publication date: 01/01/2016
Field of study

Scalable and effective exploration remains a key challenge in reinforcement learning (RL). While there are methods with optimality guarantees in the setting of discrete state and action spaces, these methods cannot be applied in high-dimensional deep RL scenarios. As such, most contemporary RL relies on simple heuristics such as epsilon-greedy exploration or adding Gaussian noise to the controls. This paper introduces Variational Information Maximizing Exploration (VIME), an exploration strategy based on maximization of information gain about the agent's belief of environment dynamics. We propose a practical implementation, using variational inference in Bayesian neural networks which efficiently handles continuous state and action spaces. VIME modifies the MDP reward function, and can be applied with several different underlying RL algorithms. We demonstrate that VIME achieves significantly better performance compared to heuristic exploration methods across a variety of continuous control tasks and algorithms, including tasks with very sparse rewards.Comment: Published in Advances in Neural Information Processing Systems 29 (NIPS), pages 1109-111

arXiv.org e-Print Archive

Ghent University Academic Bibliography