Anticipatory reward signals in ventral striatal neurons of behaving rats.

Khamassi M; Mulder AB; Tabuchi E; Douchamps V; Wiener SI

首页> 外文期刊>The European Journal of Neuroscience >Anticipatory reward signals in ventral striatal neurons of behaving rats.

【24h】

Anticipatory reward signals in ventral striatal neurons of behaving rats.

机译：行为大鼠腹侧纹状体神经元中的预期奖励信号。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

It has been proposed that the striatum plays a crucial role in learning to select appropriate actions, optimizing rewards according to the principles of 'Actor-Critic' models of trial-and-error learning. The ventral striatum (VS), as Critic, would employ a temporal difference (TD) learning algorithm to predict rewards and drive dopaminergic neurons. This study examined this model's adequacy for VS responses to multiple rewards in rats. The respective arms of a plus-maze provided rewards of varying magnitudes; multiple rewards were provided at 1-s intervals while the rat stood still. Neurons discharged phasically prior to each reward, during both initial approach and immobile waiting, demonstrating that this signal is predictive and not simply motor-related. In different neurons, responses could be greater for early, middle or late droplets in the sequence. Strikingly, this activity often reappeared after the final reward, as if in anticipation of yet another. In contrast, previous TD learning models show decremental reward-prediction profiles during reward consumption due to a temporal-order signal introduced to reproduce accurate timing in dopaminergic reward-prediction error signals. To resolve this inconsistency in a biologically plausible manner, we adapted the TD learning model such that input information is nonhomogeneously distributed among different neurons. By suppressing reward temporal-order signals and varying richness of spatial and visual input information, the model reproduced the experimental data. This validates the feasibility of a TD-learning architecture where different groups of neurons participate in solving the task based on varied input information.

机译：已经提出，纹状体在学习选择适当的动作，根据试错学习的“ Actor-Critic”模型的原则来优化奖励方面起着至关重要的作用。腹侧纹状体（VS）（如Critic）将采用时差（TD）学习算法来预测奖励并驱动多巴胺能神经元。这项研究检查了该模型对于大鼠对多种奖励的VS反应的适当性。迷宫的各个部分提供了不同程度的奖励;当大鼠静止不动时，以1-s的间隔提供多次奖励。神经元在每次奖励之前都处于阶段性放电状态，无论是在初始进场还是在静止等待期间，都表明该信号是可预测的，而不仅仅是与运动有关。在不同的神经元中，序列中早期，中期或晚期液滴的响应可能更大。令人惊讶的是，这种活动经常在获得最终奖励后重新出现，好像在期待另一个活动一样。相反，先前的TD学习模型由于引入了用来重现多巴胺能奖励预测误差信号中的准确时序的时间顺序信号，在奖励消费期间显示了递减的奖励预测配置文件。为了以生物学上合理的方式解决此不一致问题，我们采用了TD学习模型，以使输入信息在不同神经元之间非均匀分布。通过抑制奖励时间顺序信号并改变空间和视觉输入信息的丰富度，该模型再现了实验数据。这验证了TD学习架构的可行性，在该架构中，不同的神经元组会根据各种输入信息参与解决任务。

著录项

来源
《The European Journal of Neuroscience》 |2008年第9期|共18页
作者
Khamassi M; Mulder AB; Tabuchi E; Douchamps V; Wiener SI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类神经病学与精神病学;
关键词
Rewards; Learning; Rattus norvegicus; Models; 学习;

机译：奖励;学习;褐葡萄;模型;学习;

相似文献

外文文献
中文文献
专利

1. Anticipatory reward signals in ventral striatal neurons of behaving rats. [J] . Khamassi M, Mulder AB, Tabuchi E, The European Journal of Neuroscience . 2008,第9期

机译：行为大鼠腹侧纹状体神经元中的预期奖励信号。
2. Temporal Coding of Reward Value in Monkey Ventral Striatal Tonically Active Neurons [J] . Falcone Rossella, Weintraub David B., Setogawa Tsuyoshi, The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2019,第38期

机译：猴子腹侧纹纹脊髓术中的奖励价值的时间编码有冷活神经元
3. Neural Integration of Reward, Arousal, and Feeding: Recruitment of VTA, Lateral Hypothalamus, and Ventral Striatal Neurons [J] . Gutierrez Ranier, Kay Lobo Mary, Zhang Feng, IUBMB life . 2011,第10期

机译：奖励，唤醒和喂养的神经整合：VTA，下丘脑外侧和纹状体纹状神经元的招募
4. The Effects of Dopaminergic Modulation on Afferent Input Integration in the Ventral Striatal Medium Spiny Neuron [C] . John A. Wolf, Jason T. Moyer, Leif H. Finkel Triennial Meeting of the International Basal Ganglia Society . 2009

机译：多巴胺能调节对腹侧纹纹培养型刺染血清传入输入整合的影响
5. Balance between dopamine and adenosine signals regulates the PKA/Rap1 pathway in striatal medium spiny neurons [D] . Zhang Xinjian, 张心健 2019

机译：多巴胺和腺苷信号之间的平衡调节纹状体中棘神经元中的PKA / Rap1途径
6. Temporal Coding of Reward Value in Monkey Ventral Striatal Tonically Active Neurons [O] . Rossella Falcone, David B. Weintraub, Tsuyoshi Setogawa, 2019

机译：猴子腹侧纹状体活动神经元的奖励价值的时间编码
7. Topographic distinction in long-term value signals between presumed dopamine neurons and presumed striatal projection neurons in behaving monkeys [O] . Kazuki Enomoto, Naoyuki Matsumoto, Hitoshi Inokawa, 2020

机译：推定多巴胺神经元之间的长期值信号的地形区分，并在行为猴子中的推定纹状体投影神经元

Anticipatory reward signals in ventral striatal neurons of behaving rats.

摘要

著录项

相似文献

相关主题

期刊订阅