Article (Scientific journals)
Reward prediction error neurons implement an efficient code for reward.
SCHÜTT, Heiko; Kim, Dongjae; Ma, Wei Ji
2024In Nature Neuroscience, 27 (7), p. 1333 - 1339
Peer Reviewed verified by ORBi
 

Files


Full Text
2022.11.03.515104.full.pdf
Author preprint (2.2 MB) Creative Commons License - Attribution, Non-Commercial, No Derivative
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Animals; Mice; Reinforcement, Psychology; Neurons/physiology; Dopaminergic Neurons/physiology; Macaca mulatta; Male; Reward; Models, Neurological; Dopaminergic Neurons; Neurons; Neuroscience (all)
Abstract :
[en] We use efficient coding principles borrowed from sensory neuroscience to derive the optimal neural population to encode a reward distribution. We show that the responses of dopaminergic reward prediction error neurons in mouse and macaque are similar to those of the efficient code in the following ways: the neurons have a broad distribution of midpoints covering the reward distribution; neurons with higher thresholds have higher gains, more convex tuning functions and lower slopes; and their slope is higher when the reward distribution is narrower. Furthermore, we derive learning rules that converge to the efficient code. The learning rule for the position of the neuron on the reward axis closely resembles distributional reinforcement learning. Thus, reward prediction error neuron responses may be optimized to broadcast an efficient reward signal, forming a connection between efficient coding and reinforcement learning, two of the most successful theories in computational neuroscience.
Disciplines :
Neurosciences & behavior
Author, co-author :
SCHÜTT, Heiko   ;  University of Luxembourg ; Center for Neural Science and Department of Psychology, New York University, New York, NY, USA. heiko.schutt@uni.lu
Kim, Dongjae ;  Center for Neural Science and Department of Psychology, New York University, New York, NY, USA ; Department of AI-Based Convergence, Dankook University, Yongin, Republic of Korea
Ma, Wei Ji ;  Center for Neural Science and Department of Psychology, New York University, New York, NY, USA
 These authors have contributed equally to this work.
External co-authors :
yes
Language :
English
Title :
Reward prediction error neurons implement an efficient code for reward.
Publication date :
July 2024
Journal title :
Nature Neuroscience
ISSN :
1097-6256
eISSN :
1546-1726
Publisher :
Nature Research, United States
Volume :
27
Issue :
7
Pages :
1333 - 1339
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBilu :
since 16 December 2024

Statistics


Number of views
93 (9 by Unilu)
Number of downloads
27 (4 by Unilu)

Scopus citations®
 
7
Scopus citations®
without self-citations
7
OpenCitations
 
0
OpenAlex citations
 
9
WoS citations
 
6

Bibliography


Similar publications



Contact ORBilu