首页> 美国卫生研究院文献>Frontiers in Neural Circuits >Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity

【2h】

Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity

机译：具有奖励依赖可塑性的尖峰模型中研究的不同基底神经节通路的功能相关性

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The brain enables animals to behaviorally adapt in order to survive in a complex and dynamic environment, but how reward-oriented behaviors are achieved and computed by its underlying neural circuitry is an open question. To address this concern, we have developed a spiking model of the basal ganglia (BG) that learns to dis-inhibit the action leading to a reward despite ongoing changes in the reward schedule. The architecture of the network features the two pathways commonly described in BG, the direct (denoted D1) and the indirect (denoted D2) pathway, as well as a loop involving striatum and the dopaminergic system. The activity of these dopaminergic neurons conveys the reward prediction error (RPE), which determines the magnitude of synaptic plasticity within the different pathways. All plastic connections implement a versatile four-factor learning rule derived from Bayesian inference that depends upon pre- and post-synaptic activity, receptor type, and dopamine level. Synaptic weight updates occur in the D1 or D2 pathways depending on the sign of the RPE, and an efference copy informs upstream nuclei about the action selected. We demonstrate successful performance of the system in a multiple-choice learning task with a transiently changing reward schedule. We simulate lesioning of the various pathways and show that a condition without the D2 pathway fares worse than one without D1. Additionally, we simulate the degeneration observed in Parkinson's disease (PD) by decreasing the number of dopaminergic neurons during learning. The results suggest that the D1 pathway impairment in PD might have been overlooked. Furthermore, an analysis of the alterations in the synaptic weights shows that using the absolute reward value instead of the RPE leads to a larger change in D1.

机译：大脑可以使动物适应行为，以便在复杂而动态的环境中生存，但是如何通过其底层的神经回路实现和计算奖励导向的行为却是一个悬而未决的问题。为了解决这个问题，我们开发了基底神经节（BG）的峰值模型，该模型学会了在奖励时间表不断变化的情况下，抑制导致奖励的动作。该网络的体系结构具有BG中通常描述的两个路径，即直接（表示为D1）和间接（表示为D2）路径，以及涉及纹状体和多巴胺能系统的回路。这些多巴胺能神经元的活动传达了奖励预测误差（RPE），该误差决定了不同途径中突触可塑性的大小。所有塑料连接都实现了一种通用的四因素学习规则，该规则是根据贝叶斯推断得出的，该规则取决于突触前后的活动，受体类型和多巴胺水平。取决于RPE的征兆，D1或D2途径中会发生突触权重更新，而有效拷贝会向上游核告知所选的作用。我们演示了在选择项学习任务中系统的成功表现，该学习方案具有瞬息万变的奖励计划。我们模拟了各种途径的损害，并表明没有D2途径的病情要比没有D1的病情更糟。此外，我们通过减少学习过程中多巴胺能神经元的数量来模拟在帕金森氏病（PD）中观察到的变性。结果表明PD中的D1通路损伤可能已被忽略。此外，对突触权重变化的分析表明，使用绝对奖励值代替RPE会导致D1的较大变化。

著录项

期刊名称 Frontiers in Neural Circuits
作者
Pierre Berthet; Mikael Lindahl; Philip J. Tully; Jeanette Hellgren-Kotaleski; Anders Lansner;
展开▼
作者单位

展开▼
年(卷),期 2016(10),-1
年度 2016
页码 53
总页数 21
原文格式 PDF
正文语种
中图分类神经科学;
关键词
basal ganglia action selection reinforcement learning synaptic plasticity dopamine reward prediction error Parkinsons disease;

机译：基底神经节;动作选择;强化学习;突触可塑性;多巴胺;奖赏预测误差;帕金森氏病;

相似文献

外文文献
中文文献
专利

1. Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity [J] . Pierre Berthet, Mikael Lindahl, Philip J. Tully, Frontiers in Neural Circuits . 2016,第10期

机译：具有奖励依赖可塑性的尖峰模型中研究的不同基底神经节通路的功能相关性
2. Functional requirements for reward-modulated spike-timing-dependent plasticity. [J] . Fremaux N, Sprekeler H, Gerstner W The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2010,第40期

机译：奖励调制的依赖尖峰时序的可塑性的功能要求。
3. Effects of spike-time dependent plasticity on deep brain stimulation of the basal ganglia for treatment of Parkinson's disease [J] . Logan L Grado, Matthew D Johnson, Theoden I Netoff BMC Neuroscience . 2015,第SUPPLEMENTa1期

机译：尖峰时间依赖性可塑性对基底神经节深部脑刺激治疗帕金森氏病的影响
4. Learning from Delayed Reward und Punishment in a Spiking Neural Network Model of Basal Ganglia with Opposing D1/D2 Plasticity [C] . Jenia Jitsev, Nobi Abraham, Abigail Morrison, International conference on artificial neural networks . 2012

机译：在具有相反D1 / D2可塑性的基础神经节突刺神经网络模型中从延迟奖励和惩罚中学习
5. Indirect Training Algorithms for Spiking Neural Networks based on Spiking Timing Dependent Plasticity and Their Applications. [D] . Zhang, Xu. 2017

机译：基于尖峰时序相关可塑性的尖峰神经网络间接训练算法及其应用。
6. Functional Requirements for Reward-Modulated Spike-Timing-Dependent Plasticity [O] . Nicolas Frémaux, Henning Sprekeler, Wulfram Gerstner 2010

机译：奖励调制的穗定时依赖可塑性的功能要求
7. Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity [O] . Pierre Berthet, Mikael Lindahl, Philip Joseph Tully, 2016

机译：具有奖赏依赖可塑性的尖峰模型研究不同基底神经节通路的功能相关性

Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity

摘要

著录项

相似文献

相关主题

期刊订阅