Predictive reward signal of dopamine neurons.

Schultz W

首页> 外文期刊>Journal of Neurophysiology >Predictive reward signal of dopamine neurons.

【24h】

Predictive reward signal of dopamine neurons.

机译：多巴胺神经元的预测性奖励信号。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The effects of lesions, receptor blocking, electrical self-stimulation, and drugs of abuse suggest that midbrain dopamine systems are involved in processing reward information and learning approach behavior. Most dopamine neurons show phasic activations after primary liquid and food rewards and conditioned, reward-predicting visual and auditory stimuli. They show biphasic, activation-depression responses after stimuli that resemble reward-predicting stimuli or are novel or particularly salient. However, only few phasic activations follow aversive stimuli. Thus dopamine neurons label environmental stimuli with appetitive value, predict and detect rewards and signal alerting and motivating events. By failing to discriminate between different rewards, dopamine neurons appear to emit an alerting message about the surprising presence or absence of rewards. All responses to rewards and reward-predicting stimuli depend on event predictability. Dopamine neurons are activated by rewarding events that are better than predicted, remain uninfluenced by events that are as good as predicted, and are depressed by events that are worse than predicted. By signaling rewards according to a prediction error, dopamine responses have the formal characteristics of a teaching signal postulated by reinforcement learning theories. Dopamine responses transfer during learning from primary rewards to reward-predicting stimuli. This may contribute to neuronal mechanisms underlying the retrograde action of rewards, one of the main puzzles in reinforcement learning. The impulse response releases a short pulse of dopamine onto many dendrites, thus broadcasting a rather global reinforcement signal to postsynaptic neurons. This signal may improve approach behavior by providing advance reward information before the behavior occurs, and may contribute to learning by modifying synaptic transmission. The dopamine reward signal is supplemented by activity in neurons in striatum, frontal cortex, and amygdala, which process specific reward information but do not emit a global reward prediction error signal. A cooperation between the different reward signals may assure the use of specific rewards for selectively reinforcing behaviors. Among the other projection systems, noradrenaline neurons predominantly serve attentional mechanisms and nucleus basalis neurons code rewards heterogeneously. Cerebellar climbing fibers signal errors in motor performance or errors in the prediction of aversive events to cerebellar Purkinje cells. Most deficits following dopamine-depleting lesions are not easily explained by a defective reward signal but may reflect the absence of a general enabling function of tonic levels of extracellular dopamine. Thus dopamine systems may have two functions, the phasic transmission of reward information and the tonic enabling of postsynaptic neurons.

机译：损害，受体阻滞，自我电刺激和滥用药物的影响表明，中脑多巴胺系统参与处理奖励信息和学习方法行为。大多数的多巴胺神经元在主要的液体和食物奖励以及有条件的，奖励预测的视觉和听觉刺激后显示出阶段性激活。它们在刺激后表现出双相的，激活-抑制反应，类似于奖励预测的刺激，或者是新颖的或特别显着的。但是，只有很少的阶段性激活遵循厌恶性刺激。因此，多巴胺神经元以环境价值标记环境刺激，预测和检测奖励并发出警报和激励事件。由于无法区分不同的奖励，多巴胺神经元似乎发出有关奖励存在或不存在的惊人提示。对奖励和奖励预测刺激的所有响应都取决于事件的可预测性。多巴胺神经元通过奖励比预期更好的事件而被激活，而不受与预期一样好的事件的影响，而由于不良于预期的事件而被压抑。通过根据预测误差发出奖励信号，多巴胺反应具有强化学习理论所假定的教学信号的形式特征。在学习过程中，多巴胺反应从初级奖励转换为预测奖励的刺激。这可能有助于奖赏逆行作用的神经元机制，这是强化学习的主要难题之一。冲动反应将短波多巴胺释放到许多树突上，从而向突触后神经元广播相当全面的增强信号。该信号可以通过在行为发生之前提供预先的奖励信息来改善进近行为，并且可以通过修改突触传递来促进学习。多巴胺奖励信号由纹状体，额叶皮层和杏仁核中神经元的活动所补充，这些神经元处理特定的奖励信息，但不会发出全局奖励预测错误信号。不同奖励信号之间的合作可以确保使用特定奖励来选择性地增强行为。在其他投影系统中，去甲肾上腺素神经元主要充当注意力机制，而基底核神经元编码的奖励异质性。小脑攀爬纤维向小脑浦肯野细胞发出运动性能错误或厌恶事件预测错误的信号。消耗多巴胺的病变后的大多数缺陷不易通过有缺陷的奖赏信号来解释，但可能反映出缺乏细胞外多巴胺的补品水平的一般启用功能。因此，多巴胺系统可能具有两种功能，即奖励信息的阶段性传递和突触后神经元的补药作用。

著录项

来源
《Journal of Neurophysiology》 |1998年第1期|共27页
作者
Schultz W;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人体生理学;
关键词
Brain; Dopamine; Learning; Neurons; Reward; 脑; 多巴胺; 学习; 神经元; 奖励;

机译：Brain;Dopamine;Learning;Neurons;Reward;脑;多巴胺;学习;神经元;奖励;

相似文献

外文文献
中文文献
专利

1. Predictive reward signal of dopamine neurons. [J] . Schultz W Journal of Neurophysiology . 1998,第1期

机译：多巴胺神经元的预测性奖励信号。
2. TD models of reward predictive responses in dopamine neurons. [J] . Suri RE Neural Networks: The Official Journal of the International Neural Network Society . 2002,第4a6期

机译：TD模型的多巴胺神经元的奖励预测反应。
3. Multilocus genetic composite reflecting dopamine signaling capacity predicts reward circuitry responsivity [J] . SticeE., YokumS., BurgerK., The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2012,第29期

机译：反映多巴胺信号传导能力的多基因遗传复合物预测奖励电路的响应性
4. The dopamine D_3 system: new opportunities for dopamine-based reward [C] . Christian Heidbrede Symposium on Understanding Nicotine and Tobacco Addiction . 2006

机译：多巴胺D_3系统：基于多巴胺的奖励的新机会
5. Roles of dopamine D1 and D2, opiod and glutamate NMDA receptor signaling in the acquisition and expression of fat- and glucose-conditioned flavor preferences in rats and c-Fos analysis of the dopamine mesotelencephalic and nigrostriatal pathways following intake of sugars and fats in rats. [D] . Dela Cruz, Julie. 2014

机译：多巴胺D1和D2，阿片和谷氨酸NMDA受体信号传导在大鼠脂肪和葡萄糖调节的风味偏好的获取和表达中的作用以及在摄入糖和脂肪后多巴胺中脑和黑质纹状体通路的c-Fos分析。
6. Primary food reward and reward predictive stimuli evoke different patterns of phasic dopamine signaling throughout the striatum [O] . Holden D. Brown, James E. McCutcheon, Jackson J. Cone, -1

机译：初级食品奖励和奖励预测刺激在整个纹状体中引起不同模式的相阶段多巴胺信号传导
7. Primary food reward and reward-predictive stimuli evoke different patterns of phasic dopamine signaling throughout the striatum [O] . Brown, Holden D., McCutcheon, James E ., Cone, Jackson J., 2011

机译：初级食物奖励和奖励预测刺激在整个纹状体中引起了不同的阶段性多巴胺信号传导模式

Predictive reward signal of dopamine neurons.

摘要

著录项

相似文献

相关主题

期刊订阅