首页> 美国卫生研究院文献>Frontiers in Neural Circuits >Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions

【2h】

Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions

机译：眼动学习再探：基底神经节的强化学习模型结合了运动动作的有效副本

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

In its simplest formulation, reinforcement learning is based on the idea that if an action taken in a particular context is followed by a favorable outcome, then, in the same context, the tendency to produce that action should be strengthened, or reinforced. While reinforcement learning forms the basis of many current theories of basal ganglia (BG) function, these models do not incorporate distinct computational roles for signals that convey context, and those that convey what action an animal takes. Recent experiments in the songbird suggest that vocal-related BG circuitry receives two functionally distinct excitatory inputs. One input is from a cortical region that carries context information about the current “time” in the motor sequence. The other is an efference copy of motor commands from a separate cortical brain region that generates vocal variability during learning. Based on these findings, I propose here a general model of vertebrate BG function that combines context information with a distinct motor efference copy signal. The signals are integrated by a learning rule in which efference copy inputs gate the potentiation of context inputs (but not efference copy inputs) onto medium spiny neurons in response to a rewarded action. The hypothesis is described in terms of a circuit that implements the learning of visually guided saccades. The model makes testable predictions about the anatomical and functional properties of hypothesized context and efference copy inputs to the striatum from both thalamic and cortical sources.

机译：在最简单的表述中，强化学习是基于这样的思想，即如果在特定情况下采取的行动后产生了有利的结果，那么在同一情况下，应当加强或加强产生该行动的趋势。尽管强化学习是当前许多基础神经节（BG）功能理论的基础，但这些模型并未为传达情境的信号以及传达动物所采取的行动的信号合并了不同的计算角色。鸣禽中的最新实验表明，与声音相关的BG电路接收到两个功能上不同的兴奋性输入。一个输入来自皮质区域，该区域携带有关电机序列中当前“时间”的上下文信息。另一个是来自单独皮质大脑区域的运动命令的相似副本，在学习过程中会产生声音变异。基于这些发现，我在这里提出了脊椎动物BG功能的通用模型，该模型将上下文信息与独特的运动效果复制信号相结合。通过学习规则对信号进行积分，在学习规则中，响应复制输入响应于奖励动作，将上下文输入（而不是参考复制输入）的增强作用控制在中棘神经元上。根据实现对视觉引导扫视的学习的电路来描述该假设。该模型可对来自假丘脑和皮层来源的假想背景和纹状体的仿生输入的解剖学和功能特性做出可测试的预测。

著录项

期刊名称 Frontiers in Neural Circuits
作者
Michale S. Fee;
展开▼
作者单位

展开▼
年(卷),期 2012(6),-1
年度 2012
页码 38
总页数 18
原文格式 PDF
正文语种
中图分类神经科学;
关键词
context corticostriatal efference copy motor learning songbird striatum thalamostriatal;

机译：上下文;皮质口;效仿;运动学习;鸣禽;纹状体;丘脑;

相似文献

外文文献
中文文献
专利

1. An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making, reward prediction, and punishment learning [J] . Balasubramani, Pragathi P. Frontiers in Computational Neuroscience . 2014,第4期

机译：扩展的基底神经节强化学习模型，以了解5-羟色胺和多巴胺在基于风险的决策，奖励预测和惩罚学习中的作用
2. Time representation in reinforcement learning models of the basal ganglia [J] . Gershman, Samuel Joseph Frontiers in Computational Neuroscience . 2014,第1期

机译：基底神经节强化学习模型中的时间表示
3. Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats [J] . Mehdi Khamassi, Loiec Lacheze, Benoit Girard, Adaptive Behavior . 2005,第2期

机译：基底神经节中强化学习的演员-批判模型：从自然大鼠到人工大鼠
4. Reinforcement learning with multiple representations in the basal ganglia loops for sequential motor control [C] . Nakahara, H., Doya, . 1998

机译：在基底神经节回路中具有多种表示的强化学习，用于顺序运动控制
5. Neural network models of reinforcement learning and oculomotor decision-making in the basal ganglia and frontal cortex. [D] . Brown, Joshua W. 2001

机译：基底神经节和额叶皮层的强化学习和动眼神经决策的神经网络模型。
6. An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making reward prediction and punishment learning [O] . Pragathi P. Balasubramani, V. Srinivasa Chakravarthy, Balaraman Ravindran, 2014

机译：扩展的基底神经节强化学习模型以了解5-羟色胺和多巴胺在基于风险的决策奖励预测和惩罚学习中的作用
7. Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions [O] . Michale Sean Fee 2012

机译：重新审视动眼神经学习：在基底神经节中加强学习的模型，包括运动动作的efference副本

Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions

摘要

著录项

相似文献

相关主题

期刊订阅