Search CORE

Frontiers - Publisher Connector

Neural Prediction Errors Reveal a Risk-Sensitive Reinforcement-Learning Process in the Human Brain

Author: Dayan Peter
Edlund Jeffrey A.
Niv Yael
O'Doherty John P.
Publication venue: 'Society for Neuroscience'
Publication date: 01/01/2012
Field of study

Humans and animals are exquisitely, though idiosyncratically, sensitive to risk or variance in the outcomes of their actions. Economic, psychological, and neural aspects of this are well studied when information about risk is provided explicitly. However, we must normally learn about outcomes from experience, through trial and error. Traditional models of such reinforcement learning focus on learning about the mean reward value of cues and ignore higher order moments such as variance. We used fMRI to test whether the neural correlates of human reinforcement learning are sensitive to experienced risk. Our analysis focused on anatomically delineated regions of a priori interest in the nucleus accumbens, where blood oxygenation level-dependent (BOLD) signals have been suggested as correlating with quantities derived from reinforcement learning. We first provide unbiased evidence that the raw BOLD signal in these regions corresponds closely to a reward prediction error. We then derive from this signal the learned values of cues that predict rewards of equal mean but different variance and show that these values are indeed modulated by experienced risk. Moreover, a close neurometric–psychometric coupling exists between the fluctuations of the experience-based evaluations of risky options that we measured neurally and the fluctuations in behavioral risk aversion. This suggests that risk sensitivity is integral to human learning, illuminating economic models of choice, neuroscientific models of affective learning, and the workings of the underlying neural mechanisms

UCL Discovery

Caltech Authors

MPG.PuRe

Dopamine, uncertainty and TD learning

Author: Dayan Peter
Duff Michael O
Niv Yael
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midbrain represent a temporal difference (TD) error in predictions of future reward, with increases above and decreases below baseline consequent on positive and negative prediction errors, respectively. However, dopamine cells have very low baseline activity, which implies that the representation of these two sorts of error is asymmetric. We explore the implications of this seemingly innocuous asymmetry for the interpretation of dopaminergic firing patterns in experiments with probabilistic rewards which bring about persistent prediction errors. In particular, we show that when averaging the non-stationary prediction errors across trials, a ramping in the activity of the dopamine neurons should be apparent, whose magnitude is dependent on the learning rate. This exact phenomenon was observed in a recent experiment, though being interpreted there in antipodal terms as a within-trial encoding of uncertainty

Springer - Publisher Connector

MPG.PuRe

Macroamylasemia as the First Manifestation of Celiac Disease

Author: Depsames Roman
Fireman Zvi
Kopelman Yael
Niv Eva
Publication venue: S. Karger AG
Publication date: 01/01/2008
Field of study

Macroamylasemia is a biochemical disorder characterized by an elevated serum amylase activity resulting from the circulation of a macromolecular complex of amylase with a serum component, often an immunoglobulin. The increased molecular weight of this complex prevents the normal renal excretion of the enzyme. A few cases of celiac patients with macroamylasemia have been published in whom the biochemical disorder disappeared after treatment with a gluten-free diet

Princeton University Open Access Repository

Statistical Computations Underlying the Dynamics of Memory Updating

Author: Gershman Samuel J.
Niv Yael
Norman Kenneth A.
Radulescu Angela
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Psychophysical and neurophysiological studies have suggested that memory is not simply a carbon copy of our experience: Memories are modified or new memories are formed depending on the dynamic structure of our experience, and specifically, on how gradually or abruptly the world changes. We present a statistical theory of memory formation in a dynamic environment, based on a nonparametric generalization of the switching Kalman filter. We show that this theory can qualitatively account for several psychophysical and neural phenomena, and present results of a new visual memory experiment aimed at testing the theory directly. Our experimental findings suggest that humans can use temporal discontinuities in the structure of the environment to determine when to form new memory traces. The statistical perspective we offer provides a coherent account of the conditions under which new experience is integrated into an old memory versus forming a new memory, and shows that memory formation depends on inferences about the underlying structure of our experience.Templeton FoundationAlfred P. Sloan Foundation (Fellowship)National Science Foundation (U.S.) (NSF Graduate Research Fellowship)National Institute of Mental Health (U.S.) (NIH Award Number R01MH098861

CiteSeerX

DSpace@MIT

A free-choice premium in the basal ganglia

Author: Angela Langdon
Angela Radulescu
Yael Niv
Publication venue
Publication date: 24/04/2020
Field of study

CiteSeerX

Complications in endoscopic retrograde cholangiopancreatography (ERCP) and endoscopic ultrasound (EUS): analysis of 7-year physician-reported adverse events

Author: Birkenfeld Shlomo
Gershtansky Yael
Kenett Ron S
Niv Yaron
Tal Yossi
Publication venue: Dove Medical Press
Publication date
Field of study

Princeton University Open Access Repository

Recommended from our members

Reconsolidation-Extinction Interactions in Fear Memory Attenuation: The Role of Inter-Trial Interval Variability

Author: Allison Auchter
Francisco Gonzalez-Lima
Lawrence K. Cormack
Marie H. Monfils
Yael Niv
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

Fear extinction typically results in the formation of a new inhibitory memory that suppresses the original conditioned response. Evidence also suggests that extinction training during a retrieval-induced labile period results in integration of the extinction memory into the original fear memory, rendering the fear memory less susceptible to reinstatement. Here we investigated the parameters by which the retrieval-extinction paradigm was most effective in memory updating. Specifically, we manipulated the intertrial intervals (ITIs) between conditional stimulus (CS) presentations during extinction, examining how having interval lengths with different degrees of variability affected the strength of memory updating. We showed that randomizing the ITI of CS presentations during extinction led to less return of fear via reinstatement than extinction with a fixed ITI. Subjects who received variable ITIs during extinction also showed higher freezing during the ITI, indicating that the randomization of CS presentations led to a higher general reactivity during extinction, which may be one potential mechanism for memory updating

Frontiers - Publisher Connector