Search CORE

253 research outputs found

Human Substantia Nigra Neurons Encode Unexpected Financial Rewards

Author: C. T. Weidemann
Cools
Cools
Daw
Frank
G. H. Baltuch
Graybiel
Hardman
Hollerman
Hyman
J. A. Blanco
J. L. Jaggi
K. A. Zaghloul
K. McGill
Knowlton
M. J. Kahana
McClure
Mirenowicz
Montague
Morris
Schultz
Schultz
Schultz
Tanaka
Ungless
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 13/03/2009
Field of study

The brain's sensitivity to unexpected outcomes plays a fundamental role in an organism's ability to adapt and learn new behaviors. Emerging research suggests that midbrain dopaminergic neurons encode these unexpected outcomes. We used microelectrode recordings during deep brain stimulation surgery to study neuronal activity in the human substantia nigra (SN) while patients with Parkinson's disease engaged in a probabilistic learning task motivated by virtual financial rewards. Based on a model of the participants' expected reward, we divided trial outcomes into expected and unexpected gains and losses. SN neurons exhibited significantly higher firing rates after unexpected gains than unexpected losses. No such differences were observed after expected gains and losses. This result provides critical support for the hypothesized role of the SN in human reinforcement learning

Crossref

Cronfa at Swansea University

A Neural Substrate of Prediction and Reward

Author: Beninger
Beninger
Cepeda
Corbett
Di Chiara
Dolan
Goldman-Rakic
Graybiel
Grossberg
Le Moal
Mirenowicz
Mirenowicz
Montague
Montague
Montague
Montague
Mora
P. Dayan
P. R. Montague
Perrett
Phillips
Raymond
Sawaguchi
Schultz
Schultz
Schultz
Smith
Solomon
Sutton
W. Schultz
Williams
Wise
Wise
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date
Field of study

Crossref

Recommended from our members

A Java simulator of Rescorla and Wagner's prediction error model and configural cue extensions

Author: Alberto Fernández
Andrew
Baetu
Blankenship
Cheng
Cohen
Danks
Eduardo Alonso
Esther Mondragón
Glautier
Haddon
Haselgrove
Haselgrove
Holland
Kapur
Lachnit
Le Pelley
Lipp
Mackintosh
Marks
Mercier
Miller
Miller
Mirenowicz
Murphy
Pearce
Pearce
Pearce
Renner
Rescorla
Rescorla
Rescorla
Rescorla
Saavedra
Schachtman
Schmajuk
Schmajuk
Schultheis
Schultheis
Schultz
Schultz
Schultz
Shanks
Speekenbrink
Sutton
Thorwart
Van Hamme
Vogel
Waelti
Wagner
Zelikowsky
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

In this paper we present the “R&W Simulator” (version 3.0), a Java simulator of Rescorla and Wagner's prediction error model of learning. It is able to run whole experimental designs, and compute and display the associative values of elemental and compound stimuli simultaneously, as well as use extra configural cues in generating compound values; it also permits change of the US parameters across phases. The simulator produces both numerical and graphical outputs, and includes a functionality to export the results to a data processor spreadsheet. It is user-friendly, and built with a graphical interface designed to allow neuroscience researchers to input the data in their own “language”. It is a cross-platform simulator, so it does not require any special equipment, operative system or support program, and does not need installation. The “R&W Simulator” (version 3.0) is available free

City Research Online

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UCL Discovery

Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli

Author: Abercrombie
Bolanos
Carlezon
Coizet
D. I. Brierley
Daw
F. Brischoux
Faure
Ford
F
Gonon
Grace
Grace
Haber
Hasue
Horvitz
Hyland
Ikemoto
Ikemoto
Ikemoto
Joseph
Kalivas
Luo
M. A. Ungless
Margolis
Matsumoto
Mirenowicz
Moutoussis
Nair-Roberts
Olson
Overton
Pezze
Pinault
Redgrave
S. Chakraborty
Schultz
Schultz
Steffensen
Tanimoto
Ungless
Ungless
Urch
Wise
Yamaguchi
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date
Field of study

Crossref

The causal role between phasic midbrain dopamine signals and learning

Author: Adamantidis
Aquili
Berridge
Beyene
Bromberg-Martin
Bromberg-Martin
Brooks
Brown
Cacciapaglia
Chowdhury
Cohen
Courtin
Dayan
Domingos
Ersche
Fiorillo
Fiorillo
Fiorillo
Groman
Hart
Horvitz
Ilango
Jocham
Joel
Kim
Kim
Lobo
Luca Aquili
Matsumoto
May
McClure
Mirenowicz
Montague
Montague
Morita
Morris
Muranishi
Naneix
Niv
Nomoto
O'Doherty
O'Neill
Pan
Pecina
Redgrave
Redgrave
Roesch
Rolls
Schultz
Schultz
Schultz
Schultz
St Onge
Steinberg
Sunsay
Sutton
Tai
Takahashi
Tan
Thirkettle
Tsai
Tye
Van Zessen
Wanat
Wassum
Wassum
Youngren
Zhang
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2014
Field of study

The article discusses how phasic dopamine (DA) may relate to action selection, goal-directed behavior, and behavioral flexibility of a mice. It states that optogenetic targeting of midbrain DA cells and striatal projections showed role in reward prediction and behavioral flexibility. It notes that DA activity regulates aspects related to appetitive reward learning. It mentions that DA is causally involved in flexible behavioral adaptations that occur due to changes in stimulus-reward incident

Crossref

Directory of Open Access Journals

Sheffield Hallam University Research Archive

Frontiers - Publisher Connector

PubMed Central

Research Repository

Sunway Institutional Repository

Recommended from our members

Neural effects of cannabinoid CB1 neutral antagonist tetrahydrocannabivarin (THCv) on food reward and aversion in healthy volunteers

Author: Atkinson
Beck
Bergman
Christensen
Ciara McCabe
Clare Williams
Emadi
Faurion
Fawcett
Filbey
First
Friemel
Friston
Friston
Gard
Garner
Griebel
Harmer
Harpaz
Haslam
Herkenham
Herkenham
Horder
Kangas
Keedwell
Levy
Liu
Luke Tudge
Maccioni
Maldjian
Mathes
Matyas
McCabe
McCabe
McCabe
McCabe
McPartland
Mechoulam
Melis
Meye
Mirenowicz
Murray
Nitschke
Oldfield
O’Doherty
Paton
Pertwee
Philip J. Cowen
Riegel and Lupica
Rolls
Saker
Scheen
Schienle
Sim-Selley
Snaith
Solinas
Stice
Stice
Tallett
Thomas
van Hell
von Zerssen
Wang
Wang
Wilson
Winton-Brown
Zald
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

Disturbances in the regulation of reward and aversion in the brain may underlie disorders such as obesity and eating disorders. We previously showed that the cannabis receptor subtype (CB1) inverse agonist rimonabant, an antiobesity drug withdrawn due to depressogenic side effects, diminished neural reward responses yet increased aversive responses (Horder et al., 2010). Unlike rimonabant, tetrahydrocannabivarin is a neutral CB1 receptor antagonist (Pertwee, 2005) and may therefore produce different modulations of the neural reward system. We hypothesized that tetrahydrocannabivarin would, unlike rimonabant, leave intact neural reward responses but augment aversive responses. Methods: We used a within-subject, double-blind design. Twenty healthy volunteers received a single dose of tetrahydrocannabivarin (10mg) and placebo in randomized order on 2 separate occasions. We measured the neural response to rewarding (sight and/or flavor of chocolate) and aversive stimuli (picture of moldy strawberries and/or a less pleasant strawberry taste) using functional magnetic resonance imaging. Volunteers rated pleasantness, intensity, and wanting for each stimulus. Results: There were no significant differences between groups in subjective ratings. However, tetrahydrocannabivarin increased responses to chocolate stimuli in the midbrain, anterior cingulate cortex, caudate, and putamen. Tetrahydrocannabivarin also increased responses to aversive stimuli in the amygdala, insula, mid orbitofrontal cortex, caudate, and putamen. Conclusions: Our findings are the first to show that treatment with the CB1 neutral antagonist tetrahydrocannabivarin increases neural responding to rewarding and aversive stimuli. This effect profile suggests therapeutic activity in obesity, perhaps with a lowered risk of depressive side effects. Keywords: reward, THCv, obesity, fMRI, cannabinoi

Central Archive at the University of Reading

Crossref

PubMed Central

Dopamine, reward learning, and active inference

Author: Abbott
Adamantidis
Adams
Adams
Behrens
Beierholm
Berridge
Berridge
Berridge
Bishop
Cannon
Chowdhury
Clark
Cohen
Collins
D'Ardenne
Danjo
Darvas
Daw
Day
Dayan
Dayan
Diaconescu
Dickinson
Dickinson
Dolan
Eisenegger
Fiore
Fiorillo
FitzGerald
FitzGerald
Flagel
Fletcher
Frank
Frank
Friston
Friston
Friston
Friston
Friston
Friston
Frydman
Gillan
Gläscher
Guitart-Masip
Guitart-Masip
Guitart-Masip
Hollerman
Humphries
Huys
Karl Friston
Kiebel
Kurniawan
Lee
Mathys
Mirenowicz
Montague
Moustafa
Moutoussis
Mumford
Nagy
Pan
Penny
Pessiglione
Pouget
Raymond J. Dolan
Reynolds
Robbins
Robinson
Robinson
Roeper
Rossi
Rutledge
Rutledge
Salamone
Saunders
Schultz
Schultz
Schwartenbeck
Schwartenbeck
Schwartenbeck
Sharot
Shiner
Smittenaar
Steinberg
Stopper
Sutton
Tan
Tenenbaum
Thomas H. B. FitzGerald
Tsai
Voon
Witten
Wunderlich
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2015
Field of study

Temporal difference learning models propose phasic dopamine signaling encodes reward prediction errors that drive learning. This is supported by studies where optogenetic stimulation of dopamine neurons can stand in lieu of actual reward. Nevertheless, a large body of data also shows that dopamine is not necessary for learning, and that dopamine depletion primarily affects task performance. We offer a resolution to this paradox based on an hypothesis that dopamine encodes the precision of beliefs about alternative actions, and thus controls the outcome-sensitivity of behavior. We extend an active inference scheme for solving Markov decision processes to include learning, and show that simulated dopamine dynamics strongly resemble those actually observed during instrumental conditioning. Furthermore, simulated dopamine depletion impairs performance but spares learning, while simulated excitation of dopamine neurons drives reward learning, through aberrant inference about outcome states. Our formal approach provides a novel and parsimonious reconciliation of apparently divergent experimental findings

Crossref

Directory of Open Access Journals

UCL Discovery

Frontiers - Publisher Connector

PubMed Central

University of East Anglia digital repository

MPG.PuRe

Temporal-Difference Reinforcement Learning with Distributed Representations

Author: A Johnson
A Johnson
A Kacelnik
A. David Redish
AD Redish
AD Redish
AD Redish
AG Barto
AG Sanfey
AL Odum
AM Graybiel
AV Beylin
B Reynolds
CD Fiorillo
CD Fiorillo
CD Fiorillo
CR Gallistel
D Read
D Self
DC Rubin
DC Rubin
DI Laibson
DW Stephens
E Pastalkova
EA Ludvig
EA Ludvig
F Wörgötter
G Ainslie
G Ainslie
G Ainslie
G Thibaudeau
GD Stuber
GE Alexander
GE Alexander
GJ Madden
HM Bayer
HM Bayer
I Pavlov
J Gibbon
J Mazur
J Mirenowicz
J Mirenowicz
JC Jackson
JE Mazur
JER Staddon
JF Cheer
JJ Day
JP O'Doherty
JP O'Doherty
JR Hollerman
JR Norris
K Doya
K Doya
K Doya
K Doya
K Samejima
K Samejima
M Bertin
M Kawato
MF Roitman
N Schweighofer
N Schweighofer
N Schweighofer
ND Daw
ND Daw
ND Daw
ND Daw
NJ Mackintosh
NM Petry
Olaf Sporns
P Brémaud
P Dayan
P Dayan
PD Sozou
PEM Phillips
PL Strick
PR Montague
PR Solomon
PS Kaplan
R Bellman
RA Rescorla
RE Suri
RE Suri
RE Vuchinich
RJ Herrnstein
RM Wightman
RN Cardinal
RS Sutton
RS Sutton
RS Zemel
S Kakade
SC Tanaka
SC Tanaka
SH Mitchell
SJ Badtke
SM Alessi
SM McClure
SN Haber
T Das
T Kalenscher
T Ljungberg
TJ Shors
W Schultz
W Schultz
W Schultz
W Schultz
W Schultz
W Schultz
WB Levy
WB Levy
WX Pan
Y Niv
Zeb Kurth-Nelson
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Temporal-difference (TD) algorithms have been proposed as models of reinforcement learning (RL). We examine two issues of distributed representation in these TD algorithms: distributed representations of belief and distributed discounting factors. Distributed representation of belief allows the believed state of the world to distribute across sets of equivalent states. Distributed exponential discounting factors produce hyperbolic discounting in the behavior of the agent itself. We examine these issues in the context of a TD RL model in which state-belief is distributed over a set of exponentially-discounting “micro-Agents”, each of which has a separate discounting factor (γ). Each µAgent maintains an independent hypothesis about the state of the world, and a separate value-estimate of taking actions within that hypothesized state. The overall agent thus instantiates a flexible representation of an evolving world-state. As with other TD models, the value-error (δ) signal within the model matches dopamine signals recorded from animals in standard conditioning reward-paradigms. The distributed representation of belief provides an explanation for the decrease in dopamine at the conditioned stimulus seen in overtrained animals, for the differences between trace and delay conditioning, and for transient bursts of dopamine seen at movement initiation. Because each µAgent also includes its own exponential discounting factor, the overall agent shows hyperbolic discounting, consistent with behavioral experiments

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

UCL Discovery

Convergent Processing of Both Positive and Negative Motivational Signals by the VTA Dopamine Neuronal Populations

Author: A Depaulis
AA Grace
AO Komendantov
B Seymour
BI Hyland
BJ Everitt
CJ Wilson
DB Carr
Dong V. Wang
EA Kiyatkin
EB Margolis
F Brischoux
FA Guarraci
G Di Chiara
G Paxinos
H Ji
H Nakahara
Hiromu Tanimoto
HL Fields
HM Bayer
J Mirenowicz
JD Miller
JE Lisman
Joe Z. Tsien
KC Berridge
KC Berridge
L Levita
L Lin
L Lin
M Diana
M Joshua
M Joshua
M Joshua
M Karreman
M Matsumoto
M Matsumoto
MA Pezze
MF Roitman
MJ Frank
MN Baliki
MR Roesch
NS Narayanan
PN Tobler
R Cools
R Ventura
RA Wise
RL Solomon
RS Lee
S Ikemoto
S Lammel
SE Hyman
TC Jhou
TC Jhou
W Schultz
WX Pan
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Dopamine neurons in the ventral tegmental area (VTA) have been traditionally studied for their roles in reward-related motivation or drug addiction. Here we study how the VTA dopamine neuron population may process fearful and negative experiences as well as reward information in freely behaving mice. Using multi-tetrode recording, we find that up to 89% of the putative dopamine neurons in the VTA exhibit significant activation in response to the conditioned tone that predict food reward, while the same dopamine neuron population also respond to the fearful experiences such as free fall and shake events. The majority of these VTA putative dopamine neurons exhibit suppression and offset-rebound excitation, whereas ∼25% of the recorded putative dopamine neurons show excitation by the fearful events. Importantly, VTA putative dopamine neurons exhibit parametric encoding properties: their firing change durations are proportional to the fearful event durations. In addition, we demonstrate that the contextual information is crucial for these neurons to respectively elicit positive or negative motivational responses by the same conditioned tone. Taken together, our findings suggest that VTA dopamine neurons may employ the convergent encoding strategy for processing both positive and negative experiences, intimately integrating with cues and environmental context

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Sensory regulation of dopaminergic cell activity: Phenomenology, circuitry and function

Author: Ahissar
Almeida
Bahill
Bayer
Becker
Benabid
Bertram
Boksem
Brischoux
Bromberg-Martin
Bromberg-Martin
Chalupa
Chiodo
Chiodo
Coizet
Coizet
Comoli
Comoli
Craig
Di
Dommett
Dräger
Fiorillo
Fiorillo
Fiorillo
Floresco
Frankle
Frankó
Freeman
Fries
Gao
Gao
Garcia-Rill
Gauriau
Goldberg
Goto
Goto
Grace
Guarraci
Hajnal
Harting
Herbert
Hickey
Hollerman
Hommer
Horvitz
Howe
Hudgins
Hui
Hyland
Hylden
Izhikevich
Jay
Jhou
Jhou
Joshua
Karimnamazi
Kaufling
Keefe
Kelland
Keller
Kiyatkin
Kiyatkin
Klop
Ljungberg
Maeda
Mantz
Martin
Matsumoto
Matsumoto
Maunsell
May
McHaffie
McHaffie
Mileykovskiy
Miller
Mirenowicz
Mogami
Montague
Montague
Morris
Munoz
N. Vautrelle
Nomoto
Norgren
Okada
Orban
Overton
P. Redgrave
P.G. Overton
Pan
Peck
Redgrave
Redgrave
Redgrave
Redgrave
Rhoades
Roesch
Romo
Romo
Rousselet
Schiller
Schiller
Schultz
Schultz
Schultz
Schultz
Serences
Stein
Stein
Steinfels
Steinfels
Strecker
Takikawa
Thorpe
Tobler
Tokita
Tsai
Ungless
Vautrelle
Waelti
Wallace
Watabe-Uchida
Weil
White
Wise
Wurtz
Young
Zeki
Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/12/2014
Field of study

Dopaminergic neurons in a range of species are responsive to sensory stimuli. In the anesthetized preparation, responses to non-noxious and noxious sensory stimuli are usually tonic in nature, although long-duration changes in activity have been reported in the awake preparation as well. However, in the awake preparation, short-latency, phasic changes in activity are most common. These phasic responses can occur to unconditioned aversive and non-aversive stimuli, as well as to the stimuli which predict them. In both the anesthetized and awake preparations, not all dopaminergic neurons are responsive to sensory stimuli, however responsive neurons tend to respond to more than a single stimulus modality. Evidence suggests that short-latency sensory information is provided to dopaminergic neurons by relatively primitive subcortical structures – including the midbrain superior colliculus for vision and the mesopontine parabrachial nucleus for pain and possibly gustation. Although short-latency visual information is provided to dopaminergic neurons by the relatively primitive colliculus, dopaminergic neurons can discriminate between complex visual stimuli, an apparent paradox which can be resolved by the recently discovered route of information flow through to dopaminergic neurons from the cerebral cortex, via a relay in the colliculus. Given that projections from the cortex to the colliculus are extensive, such a relay potentially allows the activity of dopaminergic neurons to report the results of complex stimulus processing from widespread areas of the cortex. Furthermore, dopaminergic neurons could acquire their ability to reflect stimulus value by virtue of reward-related modification of sensory processing in the cortex. At the forebrain level, sensory-related changes in the tonic activity of dopaminergic neurons may regulate the impact of the cortex on forebrain structures such as the nucleus accumbens. In contrast, the short latency of the phasic responses to sensory stimuli in dopaminergic neurons, coupled with the activation of these neurons by non-rewarding stimuli, suggests that phasic responses of dopaminergic neurons may provide a signal to the forebrain which indicates that a salient event has occurred (and possibly an estimate of how salient that event is). A stimulus-related salience signal could be used by downstream systems to reinforce behavioral choices

Crossref

White Rose Research Online