Search CORE

2 research outputs found

Structure Learning in Human Sequential Decision-Making

Author: A Fel'dbaum
A Gelman
A Johnson
A Smith
AC Courville
AD Horowitz
AJ Yu
C Anderson
C Watkins
D Acuna
D Heckerman
DA Braun
Daniel E. Acuña
I Erev
J Anderson
J Banks
JB Tenenbaum
JB Tenenbaum
JC Gittins
JC Gittins
L Kaelbling
M Steyvers
M Steyvers
MD Lee
MJA Strens
MS Yi
N Gans
ND Daw
P Poupart
P Whittle
Paul Schrater
R Dearden
R Howard
RE Bellman
RE Bellman
RE Neapolitan
RJ Meyer
RS Sutton
SJ Gershman
TEJ Behrens
Tim Behrens
W Edwards
W Edwards
W Schultz
W Schultz
Y Brackbill
Y Sakai
Y Sakai
Publication venue: Public Library of Science
Publication date: 01/12/2010
Field of study

Studies of sequential decision-making in humans frequently find suboptimal performance relative to an ideal actor that has perfect knowledge of the model of how rewards and events are generated in the environment. Rather than being suboptimal, we argue that the learning problem humans face is more complex, in that it also involves learning the structure of reward generation in the environment. We formulate the problem of structure learning in sequential decision tasks using Bayesian reinforcement learning, and show that learning the generative model for rewards qualitatively changes the behavior of an optimal learning agent. To test whether people exhibit structure learning, we performed experiments involving a mixture of one-armed and two-armed bandit reward models, where structure learning produces many of the qualitative behaviors deemed suboptimal in previous studies. Our results demonstrate humans can perform structure learning in a near-optimal manner

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Parallel Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization

Author: Baxter J
Belgacem Bouzaiene-Ayari
Bertsekas D
Bertsekas DP
Bertsekas DP
Bertsekas DP
Boris Defourny
Burger M
Censor Y
Coleman TF
Defourny B
Giuliani M
Golub G
Kakade SM
Kober JR
Kormushev P
Lewis RM
MacKay DJ
Mannor S
Munos R
Ng AY
Nocedal J
Silver D
Somayeh Moazeni
Strens MJA
Sutton R
Sutton RS
Warren B. Powell
Yamakawa E
Publication venue: 'Institute for Operations Research and the Management Sciences (INFORMS)'
Publication date
Field of study

Crossref