Search CORE

13,522 research outputs found

Principal components analysis of reward prediction errors in a reinforcement learning task

Author: Goslin Jeremy
Sambrook Thomas D.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Models of reinforcement learning represent reward and punishment in terms of reward prediction errors (RPEs), quantitative signed terms describing the degree to which outcomes are better than expected (positive RPEs) or worse (negative RPEs). An electrophysiological component known as feedback related negativity (FRN) occurs at frontocentral sites 240-340 ms after feedback on whether a reward or punishment is obtained, and has been claimed to neurally encode an RPE. An outstanding question however, is whether the FRN is sensitive to the size of both positive RPEs and negative RPEs. Previous attempts to answer this question have examined the simple effects of RPE size for positive RPEs and negative RPEs separately. However, this methodology can be compromised by overlap from components coding for unsigned prediction error size, or "salience", which are sensitive to the absolute size of a prediction error but not its valence. In our study, positive and negative RPEs were parametrically modulated using both reward likelihood and magnitude, with principal components analysis used to separate out overlying components. This revealed a single RPE encoding component responsive to the size of positive RPEs, peaking at similar to 330ms, and occupying the delta frequency band. Other components responsive to unsigned prediction error size were shown, but no component sensitive to negative RPE size was found. (C) 2015 Elsevier Inc. All rights reserved

Crossref

Plymouth Electronic Archive and Research Library

University of East Anglia digital repository

Feedback information and the reward positivity

Author: Cockburn Jeffrey
Holroyd Clay
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

The reward positivity is a component of the event-related brain potential (ERP) sensitive to neural mechanisms of reward processing. Multiple studies have demonstrated that reward positivity amplitude indices a reward prediction error signal that is fundamental to theories of reinforcement learning. However, whether this ERP component is also sensitive to richer forms of performance information important for supervised learning is less clear. To investigate this question, we recorded the electroencephalogram from participants engaged in a time estimation task in which the type of error information conveyed by feedback stimuli was systematically varied across conditions. Consistent with our predictions, we found that reward positivity amplitude decreased in relation to increasing information content of the feedback, and that reward positivity amplitude was unrelated to trial-to-trial behavioral adjustments in task performance. By contrast, a series of exploratory analyses revealed frontal-central and posterior ERP components immediately following the reward positivity that related to these processes. Taken in the context of the wider literature, these results suggest that the reward positivity is produced by a neural mechanism that motivates task performance, whereas the later ERP components apply the feedback information according to principles of supervised learning

Ghent University Academic Bibliography

Focus on the positive : computational simulations implicate asymmetrical reward prediction error signals in childhood attention-deficit/hyperactivity disorder

Author: Cockburn Jeffrey
Holroyd Clay
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

Ghent University Academic Bibliography

Frontal midline theta and N200 amplitude reflect complementary information about expectancy and outcome evaluation

Author: Hajihosseini Azadeh
Holroyd Clay
Publication venue: 'Wiley'
Publication date: 01/01/2013
Field of study

Feedback ERN (fERN) and frontal midline theta have both been proposed to index a dopamine-like reinforcement learning signal in anterior cingulate cortex (ACC). We investigated these proposals by comparing fERN amplitude and theta power with respect to their sensitivities to outcome valence and probability in a previously collected EEG dataset. Bayesian model comparison revealed a dissociation between the two measures, with fERN amplitude mainly sensitive to valence and theta power mainly sensitive to probability. Further, fERN amplitude was highly correlated with the portion of theta power that is consistent in phase across trials (i.e., evoked theta power). These results suggest that although both measures provide valuable information about cognitive function of frontal midline cortex, fERN amplitude is specifically sensitive to dopamine reinforcement learning signals whereas theta power reflects the ACC response to unexpected events

Ghent University Academic Bibliography

The feedback correct-related positivity : sensitivity of the event-related brain potential to unexpected positive feedback

Author: Allain
Amiez
Bayer
Botvinick
Botvinick
Cohen
Coles
Delorme
Dien
Donchin
Donchin
Donchin
Emeric
Eppinger
Falkenstein
Folstein
Gehring
Gehring
Gratton
Hajcak
Hajcak
Halgren
Holroyd
Holroyd
Holroyd
Holroyd
Holroyd
Holroyd
Holroyd
Holroyd
Holroyd
Huettel
Ito
Kiehl
Linden
Luck
Mars
Marsden
Matsumoto
McCarthy
Miltner
Montague
Nieuwenhuis
Nieuwenhuis
Niki
Nunez
Pakzad-Vaezi
Potts
Pritchard
Ridderinkhof
Schultz
Spencer
Towey
Toyomaki
Ullsperger
Urbach
Van Veen
Wang
Yeung
Yeung
Publication venue: 'Wiley'
Publication date: 01/01/2008
Field of study

The N200 and the feedback error-related negativity (fERN) are two components of the event-related brain potential (ERP) that share similar scalp distributions, time courses, morphologies, and functional dependencies, which raises the question as to whether they are actually the same phenomenon. To investigate this issue, we recorded the ERP from participants engaged in two tasks that independently elicited the N200 and fERN. Our results indicate that they are, in fact, the same ERP component and further suggest that positive feedback elicits a positive-going deflection in the time range of the fERN. Taken together, these results indicate that negative feedback elicits a common N200 and that modulation of fERN amplitude results from the superposition on correct trials of a positive-going deflection that we term the feedback correct-related positivity

Crossref

Ghent University Academic Bibliography

Dissociated roles of the anterior cingulate cortex in reward and conflict processing as revealed by the feedback error-related negativity and N200

Author: Baker Travis E.
Holroyd Clay
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

Ghent University Academic Bibliography

Remembering Forward: Neural Correlates of Memory and Prediction in Human Motor Adaptation

Author: Houk James
Mosier Kristine M.
Salowitz Nicole M.G.
Scheidt Robert A.
Simo Lucia
Suminski Aaron J.
Zimbelman Janice
Publication venue: e-Publications@Marquette
Publication date: 01/01/2012
Field of study

We used functional MR imaging (FMRI), a robotic manipulandum and systems identification techniques to examine neural correlates of predictive compensation for spring-like loads during goal-directed wrist movements in neurologically-intact humans. Although load changed unpredictably from one trial to the next, subjects nevertheless used sensorimotor memories from recent movements to predict and compensate upcoming loads. Prediction enabled subjects to adapt performance so that the task was accomplished with minimum effort. Population analyses of functional images revealed a distributed, bilateral network of cortical and subcortical activity supporting predictive load compensation during visual target capture. Cortical regions – including prefrontal, parietal and hippocampal cortices – exhibited trial-by-trial fluctuations in BOLD signal consistent with the storage and recall of sensorimotor memories or “states” important for spatial working memory. Bilateral activations in associative regions of the striatum demonstrated temporal correlation with the magnitude of kinematic performance error (a signal that could drive reward-optimizing reinforcement learning and the prospective scaling of previously learned motor programs). BOLD signal correlations with load prediction were observed in the cerebellar cortex and red nuclei (consistent with the idea that these structures generate adaptive fusimotor signals facilitating cancelation of expected proprioceptive feedback, as required for conditional feedback adjustments to ongoing motor commands and feedback error learning). Analysis of single subject images revealed that predictive activity was at least as likely to be observed in more than one of these neural systems as in just one. We conclude therefore that motor adaptation is mediated by predictive compensations supported by multiple, distributed, cortical and subcortical structures

epublications@Marquette

PubMed Central

High temporal discounters overvalue immediate rewards rather than undervalue future rewards : an event-related brain potential study

Author: Cherniawsky Avital S.
Holroyd Clay
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Impulsivity is characterized in part by heightened sensitivity to immediate relative to future rewards. Although previous research has suggested that "high discounters" in intertemporal choice tasks tend to prefer immediate over future rewards because they devalue the latter, it remains possible that they instead overvalue immediate rewards. To investigate this question, we recorded the reward positivity, a component of the event-related brain potential (ERP) associated with reward processing, with participants engaged in a task in which they received both immediate and future rewards and nonrewards. The participants also completed a temporal discounting task without ERP recording. We found that immediate but not future rewards elicited the reward positivity. High discounters also produced larger reward positivities to immediate rewards than did low discounters, indicating that high discounters relatively overvalued immediate rewards. These findings suggest that high discounters may be more motivated than low discounters to work for monetary rewards, irrespective of the time of arrival of the incentives

Ghent University Academic Bibliography

Reward positivity elicited by predictive cues

Author: Holroyd Clay
Krigolson Olav E.
Lee Seung
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 01/01/2011
Field of study

A recent theory holds that a component of the human event-related brain potential called the reward positivity reflects a reward prediction error signal. We investigated this idea in gambling-like task in which, on each trial, a visual stimulus predicted a subsequent rewarding or nonrewarding outcome with 80% probability. Consistent with earlier results, we found that the reward positivity was larger to unexpected than to expected outcomes. In addition, we found that the predictive cues also elicited a reward positivity, as proposed by the theory. These results indicate that the reward positivity reflects the initial assessment of whether a trial will end in success or failure and the reappraisal of that information once the outcome actually occurs. NeuroReport 22:249-252 (C) 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins

Ghent University Academic Bibliography