pubmed.ncbi.nlm.nih.gov

Single dose of a dopamine agonist impairs reinforcement learning in humans: evidence from event-related potentials and computational modeling of striatal-cortical function - PubMed

Randomized Controlled Trial

. 2009 Jul;30(7):1963-76.

doi: 10.1002/hbm.20642.

Affiliations

PMID: 18726908
PMCID: PMC3034238
DOI: 10.1002/hbm.20642

Randomized Controlled Trial

Single dose of a dopamine agonist impairs reinforcement learning in humans: evidence from event-related potentials and computational modeling of striatal-cortical function

Diane L Santesso et al. Hum Brain Mapp. 2009 Jul.

Abstract

Animal findings have highlighted the modulatory role of phasic dopamine (DA) signaling in incentive learning, particularly in the acquisition of reward-related behavior. In humans, these processes remain largely unknown. In a recent study, we demonstrated that a single low dose of a D2/D3 agonist (pramipexole)-assumed to activate DA autoreceptors and thus reduce phasic DA bursts-impaired reward learning in healthy subjects performing a probabilistic reward task. The purpose of this study was to extend these behavioral findings using event-related potentials and computational modeling. Compared with the placebo group, participants receiving pramipexole showed increased feedback-related negativity to probabilistic rewards and decreased activation in dorsal anterior cingulate regions previously implicated in integrating reinforcement history over time. Additionally, findings of blunted reward learning in participants receiving pramipexole were simulated by reduced presynaptic DA signaling in response to reward in a neural network model of striatal-cortical function. These preliminary findings offer important insights on the role of phasic DA signals on reinforcement learning in humans and provide initial evidence regarding the spatiotemporal dynamics of brain mechanisms underlying these processes.

PubMed Disclaimer

Figures

**Figure 1**
Neural network model of cortico‐striatal circuitry (squares represent units, with height and color reflecting neural activity; yellow, most active; red, less active; gray, not active). The model includes the direct (Go) and indirect (NoGo) pathways of the basal ganglia [Frank,2005,2006]. The Go cells disinhibit the thalamus via the internal segment of globus pallidus (GPi) and thereby facilitate the execution of an action represented in cortex. The NoGo cells have an opposing effect by increasing inhibition of the thalamus, which suppresses actions and thereby keeps them from being executed. Dopamine from the substantia nigra pars compacta (SNc) projects to the dorsal striatum. A tonic level of dopamine is shown in SNc; a burst or dip ensues in a subsequent error feedback phase, causing corresponding changes in Go/NoGo unit activations, which drive learning, via simulated D1 and D2 receptors. Pramipexole was simulated by reducing the size of DA bursts during rewards to simulate presynaptic autoreceptor effects induced by the low dose. [Color figure can be viewed in the online issue, which is available at www.interscience.wiley.com.]

**Figure 2**
Left panel: Summary of (a) response bias; (b) discriminability; (c) accuracy for the more frequently rewarded (rich) stimulus; and (d) accuracy for the less frequently rewarded (lean) stimulus. Figures modified from Pizzagalli et al. [2008] with permission. Right panel: Corresponding variables for the intact neural network of cortico‐striatal circuitry (“placebo groups”) and the neural network simulating reduced presynaptic DA bursts in response to rewards (“pramipexole group”). Error bars refer to standard errors.

**Figure 3**
(a) Averaged ERP waveforms from 200 ms before to 600 ms after the presentation of correct feedback during the probabilistic reward task for the pramipexole (heavy line) and placebo (light line) group averaged across Fz, FCz, and Cz; (b) Topographic map of the FRN difference wave between the pramipexole and placebo group (pramipexole minus placebo); and (c) Results of voxel‐by‐voxel independent t‐tests contrasting current density for the placebo and pramipexole group in response to reward feedback. Red: relatively higher activity for placebo subjects. Blue: relatively higher activity for pramipexole subjects. Statistical map is thresholded at P < 0.005 and displayed on the MNI template.

Cited by

COMT Val(158) Met genotype is associated with reward learning: a replication study and meta-analysis.
Corral-Frías NS, Pizzagalli DA, Carré JM, Michalski LJ, Nikolova YS, Perlis RH, Fagerness J, Lee MR, Conley ED, Lancaster TM, Haddad S, Wolf A, Smoller JW, Hariri AR, Bogdan R. Corral-Frías NS, et al. Genes Brain Behav. 2016 Jun;15(5):503-13. doi: 10.1111/gbb.12296. Genes Brain Behav. 2016. PMID: 27138112 Free PMC article.
Dopaminergic Medication Modulates Learning from Feedback and Error-Related Negativity in Parkinson's Disease: A Pilot Study.
Volpato C, Schiff S, Facchini S, Silvoni S, Cavinato M, Piccione F, Antonini A, Birbaumer N. Volpato C, et al. Front Behav Neurosci. 2016 Oct 24;10:205. doi: 10.3389/fnbeh.2016.00205. eCollection 2016. Front Behav Neurosci. 2016. PMID: 27822182 Free PMC article.
Meditation experience predicts negative reinforcement learning and is associated with attenuated FRN amplitude.
Knytl P, Opitz B. Knytl P, et al. Cogn Affect Behav Neurosci. 2019 Apr;19(2):268-282. doi: 10.3758/s13415-018-00665-0. Cogn Affect Behav Neurosci. 2019. PMID: 30446979 Free PMC article.
Neural mechanisms of acquired phasic dopamine responses in learning.
Hazy TE, Frank MJ, O'Reilly RC. Hazy TE, et al. Neurosci Biobehav Rev. 2010 Apr;34(5):701-20. doi: 10.1016/j.neubiorev.2009.11.019. Epub 2009 Nov 26. Neurosci Biobehav Rev. 2010. PMID: 19944716 Free PMC article. Review.
Dopaminergic modulation of performance monitoring in Parkinson's disease: An event-related potential study.
Seer C, Lange F, Loens S, Wegner F, Schrader C, Dressler D, Dengler R, Kopp B. Seer C, et al. Sci Rep. 2017 Jan 24;7:41222. doi: 10.1038/srep41222. Sci Rep. 2017. PMID: 28117420 Free PMC article.

References

1. Abler B,Erk S,Walter H ( 2007): Human reward system activation is modulated by a single dose of olanzapine in healthy subjects in an event‐related, double‐blind, placebo‐controlled fMRI study. Psychopharmacology (Berl) 191: 823–833. - PubMed
1. Akitsuki Y,Sugiura M,Watanabe J,Yamashita K,Sassa Y,Awata S,Matsuoka H,Maeda Y,Matsue Y,Fukuda H,Kawashima R ( 2003): Context‐dependent cortical activation in response to financial reward and penalty: An event‐related fMRI study. Neuroimage 19: 1674–1685. - PubMed
1. Amiez C,Joseph JP,Procyk E ( 2006): Reward encoding in the monkey anterior cingulate cortex. Cereb Cortex 16: 1040–1055. - PMC - PubMed
1. Bayer HM,Glimcher PW ( 2005): Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47: 129–141. - PMC - PubMed
1. Beck AT,Steer RA,Brown GK ( 1996): Beck Depression Inventory Manual, 2nd ed. San Antonio, TX: The Psychological Corporation.

Single dose of a dopamine agonist impairs reinforcement learning in humans: evidence from event-related potentials and computational modeling of striatal-cortical function - PubMed

Single dose of a dopamine agonist impairs reinforcement learning in humans: evidence from event-related potentials and computational modeling of striatal-cortical function

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources