Standard and averaging reinforcement learning in XCS

机译：XCS中的标准和平均钢筋学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usual input mapping function translates the state-action space into a niche relative fitness space. Then, it shows that, although XCS has always been related to standard RL, XCS is actually a method of averaging RL. More precisely, XCS with gradient descent can be actually derived from the typical update of averaging RL. It is noted that the use of averaging RL in XCS introduces an intrinsic preference toward classifiers with a smaller fitness in the niche. It is argued that, because of the accuracy pressure in XCS, this results in an additional preference toward specificity. A very simple experiment is presented to support this hypothesis. The same approach is applied to XCS with computed prediction (XCSF) and similar conclusions are drawn.

机译：本文调查XCS中的加固学习（RL）。首先，它正式地示出了XC基于线性近似器实现了一种广义RL的方法，其中通常的输入映射函数将状态动作空间转换为利基相对适应空间。然后，它表明，尽管XCS始终与标准RL相关，但XC实际上是一种平均RL的方法。更确切地说，具有梯度下降的XC可以实际导出自平均R1的典型更新。应注意，在XC中使用平均R1引入了对分类器的固有偏好，在利基中具有较小的适应性。认为，由于XCS中的精度压力，这导致朝向特异性的额外偏好。提出了一个非常简单的实验以支持这一假设。使用计算预测（XCSF）的XC应用相同的方法，并绘制类似的结论。

著录项

来源
《Annual conference on Genetic and evolutionary computation》|2006年||共8页
会议地点
作者
Pier Luca Lanzi; Daniele Loiacono; PPier Luca Lanzi; PDaniele Loiacono;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类生物工程学（生物技术）;
关键词
gradient descent;

机译：梯度下降;

相似文献

外文文献
中文文献
专利

1. Swarm robots reinforcement learning convergence accuracy-based learning classifier systems with gradient descent (XCS-GD) [J] . Jie Shao, Haixia Lin, Kaibian Zhang Neural computing & applications . 2014,第2期

机译：群体机器人强化学习基于梯度下降的基于学习精度的学习分类器系统（XCS-GD）
2. Scalable reinforcement learning on Cray XC [J] . Kommaraju Ananda V, Maschhoff Kristyn J., Ringenburg Michael F., Concurrency, practice and experience . 2020,第20期

机译：Cray XC上可扩展的强化学习
3. An Analysis of Rule Deletion Scheme in XCS on Reinforcement Learning Problem [J] . Masaya Nakata, Tomoki Hamagami Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2017,第5a125期

机译：XC在加固学习问题中规则删除方案分析
4. Standard and averaging reinforcement learning in XCS [C] . Pier Luca Lanzi, Daniele Loiacono, PPier Luca Lanzi, Annual conference on Genetic and evolutionary computation;Conference on Genetic and evolutionary computation . 2006

机译：XCS中的标准和平均强化学习
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. Neural Circuits Trained with Standard Reinforcement Learning Can Accumulate Probabilistic Information during Decision Making [O] . Nils Kurzawa, Christopher Summerfield, Rafal Bogacz -1

机译：经过标准强化学习训练的神经回路可以在决策过程中积累概率信息
7. Standard and averaging reinforcement learning in XCS [O] . Pier Luca Lanzi, Daniele Loiacono 2006

机译：XCs中的标准和平均强化学习

Standard and averaging reinforcement learning in XCS

摘要

著录项

相似文献

相关主题

期刊订阅