首页> 美国卫生研究院文献>PLoS Computational Biology >Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks
【2h】

Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks

机译:递归神经网络中轮廓的链接和追踪的强化学习

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The processing of a visual stimulus can be subdivided into a number of stages. Upon stimulus presentation there is an early phase of feedforward processing where the visual information is propagated from lower to higher visual areas for the extraction of basic and complex stimulus features. This is followed by a later phase where horizontal connections within areas and feedback connections from higher areas back to lower areas come into play. In this later phase, image elements that are behaviorally relevant are grouped by Gestalt grouping rules and are labeled in the cortex with enhanced neuronal activity (object-based attention in psychology). Recent neurophysiological studies revealed that reward-based learning influences these recurrent grouping processes, but it is not well understood how rewards train recurrent circuits for perceptual organization. This paper examines the mechanisms for reward-based learning of new grouping rules. We derive a learning rule that can explain how rewards influence the information flow through feedforward, horizontal and feedback connections. We illustrate the efficiency with two tasks that have been used to study the neuronal correlates of perceptual organization in early visual cortex. The first task is called contour-integration and demands the integration of collinear contour elements into an elongated curve. We show how reward-based learning causes an enhancement of the representation of the to-be-grouped elements at early levels of a recurrent neural network, just as is observed in the visual cortex of monkeys. The second task is curve-tracing where the aim is to determine the endpoint of an elongated curve composed of connected image elements. If trained with the new learning rule, neural networks learn to propagate enhanced activity over the curve, in accordance with neurophysiological data. We close the paper with a number of model predictions that can be tested in future neurophysiological and computational studies.
机译:视觉刺激的处理可以细分为多个阶段。在刺激呈现时,存在前馈处理的早期阶段,其中视觉信息从较低的视觉区域传播到较高的视觉区域,以提取基本和复杂的刺激特征。随后是下一阶段,其中区域内的水平连接以及从较高区域到较低区域的反馈连接开始起作用。在随后的阶段中,与行为相关的图像元素由格式塔分组规则进行分组,并在皮质中用增强的神经元活动进行标记(心理学中基于对象的注意)。最近的神经生理学研究表明,基于奖励的学习会影响这些复发性分组过程,但人们还不太了解奖励如何训练感知组织的循环回路。本文研究了基于奖励的新分组规则学习机制。我们得出一个学习规则,可以解释奖励如何通过前馈,水平和反馈连接影响信息流。我们用两个任务说明了效率,这些任务已被用来研究早期视觉皮层中知觉组织的神经元相关性。第一项任务称为轮廓集成,要求将共线轮廓元素集成到细长曲线中。我们展示了基于奖励的学习如何在循环神经网络的早期水平上增强待分组元素的表示,就像在猴子的视觉皮层中观察到的那样。第二项任务是曲线跟踪,其目的是确定由连接的图像元素组成的细长曲线的端点。如果用新的学习规则训练,神经网络将根据神经生理学数据学习在曲线上传播增强的活动。我们以许多可以在未来的神经生理学和计算研究中检验的模型预测来结束本文。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号