首页> 外文会议>Annual conference on Genetic and evolutionary computation >Standard and averaging reinforcement learning in XCS
【24h】

Standard and averaging reinforcement learning in XCS

机译:XCS中的标准和平均钢筋学习

获取原文

摘要

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usual input mapping function translates the state-action space into a niche relative fitness space. Then, it shows that, although XCS has always been related to standard RL, XCS is actually a method of averaging RL. More precisely, XCS with gradient descent can be actually derived from the typical update of averaging RL. It is noted that the use of averaging RL in XCS introduces an intrinsic preference toward classifiers with a smaller fitness in the niche. It is argued that, because of the accuracy pressure in XCS, this results in an additional preference toward specificity. A very simple experiment is presented to support this hypothesis. The same approach is applied to XCS with computed prediction (XCSF) and similar conclusions are drawn.
机译:本文调查XCS中的加固学习(RL)。首先,它正式地示出了XC基于线性近似器实现了一种广义RL的方法,其中通常的输入映射函数将状态动作空间转换为利基相对适应空间。然后,它表明,尽管XCS始终与标准RL相关,但XC实际上是一种平均RL的方法。更确切地说,具有梯度下降的XC可以实际导出自平均R1的典型更新。应注意,在XC中使用平均R1引入了对分类器的固有偏好,在利基中具有较小的适应性。认为,由于XCS中的精度压力,这导致朝向特异性的额外偏好。提出了一个非常简单的实验以支持这一假设。使用计算预测(XCSF)的XC应用相同的方法,并绘制类似的结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号