首页> 美国卫生研究院文献>Frontiers in Neural Circuits >Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions
【2h】

Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions

机译:眼动学习再探:基底神经节的强化学习模型结合了运动动作的有效副本

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In its simplest formulation, reinforcement learning is based on the idea that if an action taken in a particular context is followed by a favorable outcome, then, in the same context, the tendency to produce that action should be strengthened, or reinforced. While reinforcement learning forms the basis of many current theories of basal ganglia (BG) function, these models do not incorporate distinct computational roles for signals that convey context, and those that convey what action an animal takes. Recent experiments in the songbird suggest that vocal-related BG circuitry receives two functionally distinct excitatory inputs. One input is from a cortical region that carries context information about the current “time” in the motor sequence. The other is an efference copy of motor commands from a separate cortical brain region that generates vocal variability during learning. Based on these findings, I propose here a general model of vertebrate BG function that combines context information with a distinct motor efference copy signal. The signals are integrated by a learning rule in which efference copy inputs gate the potentiation of context inputs (but not efference copy inputs) onto medium spiny neurons in response to a rewarded action. The hypothesis is described in terms of a circuit that implements the learning of visually guided saccades. The model makes testable predictions about the anatomical and functional properties of hypothesized context and efference copy inputs to the striatum from both thalamic and cortical sources.
机译:在最简单的表述中,强化学习是基于这样的思想,即如果在特定情况下采取的行动后产生了有利的结果,那么在同一情况下,应当加强或加强产生该行动的趋势。尽管强化学习是当前许多基础神经节(BG)功能理论的基础,但这些模型并未为传达情境的信号以及传达动物所采取的行动的信号合并了不同的计算角色。鸣禽中的最新实验表明,与声音相关的BG电路接收到两个功能上不同的兴奋性输入。一个输入来自皮质区域,该区域携带有关电机序列中当前“时间”的上下文信息。另一个是来自单独皮质大脑区域的运动命令的相似副本,在学习过程中会产生声音变异。基于这些发现,我在这里提出了脊椎动物BG功能的通用模型,该模型将上下文信息与独特的运动效果复制信号相结合。通过学习规则对信号进行积分,在学习规则中,响应复制输入响应于奖励动作,将上下文输入(而不是参考复制输入)的增强作用控制在中棘神经元上。根据实现对视觉引导扫视的学习的电路来描述该假设。该模型可对来自假丘脑和皮层来源的假想背景和纹状体的仿生输入的解剖学和功能特性做出可测试的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号