【24h】

Reinforcement learning in swarms that learn

机译:大量学习强化学习

获取原文

摘要

This paper introduces an approach to reinforcement learning by cooperating agents using a variation of the actor critic method. This is made possible by considering behavior patterns of swarms in the context of approximation spaces. Rough set theory introduced by Zdzislaw Pawlak in 1982 provides a ground for deriving pattern-based rewards within approximation spaces. The framework provided by an approximation space makes it possible to derive pattern-based reference rewards used to estimate action preferences. Approximation spaces are used to derive action-based reference rewards at the swarm intelligence level. Two different forms of the actor critic reinforcement learning method are considered as a part of a study of learning in real-time by a swarm. The contribution of this article is the presentation of a new actor critic method defined in the context of approximation spaces. An ecosystem designed to facilitate study of reinforcement learning by swarms is briefly described. In addition, the results of ecosystem experiments for two forums of the actor critic method are given.
机译:本文介绍了一种通过使用行为评论者方法的变体来通过合作主体进行强化学习的方法。通过在近似空间的情况下考虑群体的行为模式,可以做到这一点。 Zdzislaw Pawlak在1982年提出的粗糙集理论为在近似空间内得出基于模式的奖励提供了基础。近似空间提供的框架使得可以推导出用于估计动作偏好的基于模式的参考奖励。近似空间用于在群体智能级别上得出基于动作的参考奖励。演员批评家强化学习方法的两种不同形式被视为群体实时学习研究的一部分。本文的贡献是在近似空间的上下文中定义了一种新的演员评论家方法。简要描述了旨在促进群体强化学习研究的生态系统。此外,还给出了演员批评家方法两个论坛的生态系统实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号