Bandit Models of Human Behavior: Reward Processing in Mental Disorders

A Dezfouli; AD Redish; AM Taylor; D Bouneffouf; D Bouneffouf; DC Perry; LE Hess; M Luman; MJ Frank; P Auer; P Auer; P Auer; TL Lai; TU Hauser; W Thompson; WR Thompson; WW Seeley

research

Bandit Models of Human Behavior: Reward Processing in Mental Disorders

Authors: A Dezfouli
AD Redish
AM Taylor
D Bouneffouf
D Bouneffouf
DC Perry
LE Hess
M Luman
MJ Frank
P Auer
P Auer
P Auer
TL Lai
TU Hauser
W Thompson
WR Thompson
WW Seeley
Publication date: 7 June 2017
Publisher
Doi

Abstract

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for multi-armed bandit problem, which extends the standard Thompson Sampling approach to incorporate reward processing biases associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain. We demonstrate empirically that the proposed parametric approach can often outperform the baseline Thompson Sampling on a variety of datasets. Moreover, from the behavioral modeling perspective, our parametric framework can be viewed as a first step towards a unifying computational model capturing reward processing abnormalities across multiple mental conditions.Comment: Conference on Artificial General Intelligence, AGI-1

Similar works

Full text

Available Versions

Crossref

info:doi/10.1007%2F978-3-319-6...

Last time updated on 06/08/2021