Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats

Mehdi Khamassi; Loiec Lacheze; Benoit Girard; Alain Berthoz; Agnes Guillot

首页> 外文期刊>Adaptive Behavior >Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats

【24h】

Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats

机译：基底神经节中强化学习的演员-批判模型：从自然大鼠到人工大鼠

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Since 1995, numerous Actor-Critic architectures for reinforcement learning have been proposed as models of dopamine-like reinforcement learning mechanisms in the rat's basal ganglia. However, these models were usually tested in different tasks, and it is then difficult to compare their efficiency for an autonomous animat. We present here the comparison of four architectures in an animat as it performs the same reward-seeking task. This will illustrate the consequences of different hypotheses about the management of different Actor sub-modules and Critic units, and their more or less autonomously determined coordination. We show that the classical method of coordination of modules by mixture of experts, depending on each module's performance, did not allow solving our task. Then we address the question of which principle should be applied efficiently to combine these units. Improvements for Critic modeling and accuracy of Actor-Critic models for a natural task are finally discussed in the perspective of our Psikharpax project—an artificial rat having to survive autonomously in unpredictable environments.

机译：自1995年以来，已经提出了许多用于强化学习的Actor-Critic体系结构，作为大鼠基底神经节中多巴胺样强化学习机制的模型。但是，这些模型通常在不同的任务中进行测试，因此很难比较它们对于自主动画的效率。我们在这里展示了动画中四种架构的比较，因为它执行相同的奖励任务。这将说明有关不同Actor子模块和Critic单元的管理的不同假设的后果，以及它们或多或少地自主确定的协调。我们证明了由专家的混合来协调模块的经典方法（取决于每个模块的性能）不能解决我们的任务。然后，我们讨论应该有效应用哪种原理来组合这些单元的问题。最后，从我们的Psikharpax项目的角度讨论了针对自然任务的Critic建模和Actor-Critic模型的准确性的改进，该项目是必须在无法预测的环境中自主生存的人工大鼠。

著录项

来源
《Adaptive Behavior》 |2005年第2期|p.131-148|共18页
作者
Mehdi Khamassi; Loiec Lacheze; Benoit Girard; Alain Berthoz; Agnes Guillot;
展开▼
作者单位

AnimatLab, LIP6, 8 rue du capitaine Scott, 75015 Paris, France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学;
关键词
animat approach; TD teaming; actor-critic model; S-R task; taxon navigation;

机译：animat方法;TD分组;行为批评模型;S-R任务;分类导航;

相似文献

外文文献
中文文献
专利

1. Believer-Skeptic Meets Actor-Critic: Rethinking the Role of Basal Ganglia Pathways during Decision-Making and Reinforcement Learning [J] . Kyle Dunovan, Timothy Verstynen Frontiers in Neuroscience . 2016,第2009期

机译：信奉怀疑论者遇到演员批评者：重新思考基础神经节通路在决策和强化学习中的作用
2. THE ROLE OF THE BASAL GANGLIA IN EXPLORATION IN A NEURAL MODEL BASED ON REINFORCEMENT LEARNING [J] . D. SRIDHARAN, P. S. PRASHANTH, V. S. CHAKRAVARTHY International Journal of Neural Systems . 2006,第2期

机译：基于强化学习的神经元模型在基底神经节勘探中的作用
3. An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making, reward prediction, and punishment learning [J] . Balasubramani, Pragathi P. Frontiers in Computational Neuroscience . 2014,第4期

机译：扩展的基底神经节强化学习模型，以了解5-羟色胺和多巴胺在基于风险的决策，奖励预测和惩罚学习中的作用
4. Combining Self-organizing Maps with Mixtures of Experts: Application to an Actor-Critic Model of Reinforcement Learning in the Basal Ganglia [C] . Mehdi Khamassi, Louis-Emmanuel Martinet, Agnes Guillot From Animals to Animats 9; Lecture Notes in Artificial Intelligence; 4095 . 2006

机译：将自组织地图与专家混合在一起：在基础神经节的强化学习的演员-批评模型中的应用
5. Neural network models of reinforcement learning and oculomotor decision-making in the basal ganglia and frontal cortex. [D] . Brown, Joshua W. 2001

机译：基底神经节和额叶皮层的强化学习和动眼神经决策的神经网络模型。
6. Believer-Skeptic Meets Actor-Critic: Rethinking the Role of Basal Ganglia Pathways during Decision-Making and Reinforcement Learning [O] . Kyle Dunovan, Timothy Verstynen 2016

机译：怀疑论者遇到演员批评者：重新思考基础神经节通路在决策和强化学习中的作用
7. Actor-critic models of reinforcement learning in the basal ganglia: From natural to artificial rats [O] . Khamassi, Mehdi, Lachèze, Loïc, Girard, Benoît, 2005

机译：基底神经节中强化学习的行为批评模型：从自然大鼠到人工大鼠

Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats

摘要

著录项

相似文献

相关主题

期刊订阅