首页> 外文会议>International Conference on Artificial Neural Networks >Leveraging Domain Knowledge for Reinforcement Learning Using MMC Architectures
【24h】

Leveraging Domain Knowledge for Reinforcement Learning Using MMC Architectures

机译:利用领域知识进行MMC架构的强化学习

获取原文

摘要

Despite the success of reinforcement learning methods in various simulated robotic applications, end-to-end training suffers from extensive training times due to high sample complexity and does not scale well to realistic systems. In this work, we speed up reinforcement learning by incorporating domain knowledge into policy learning. We revisit an architecture based on the mean of multiple computations (MMC) principle known from computational biology and adapt it to solve a "reacher task". We approximate the policy using a simple MMC network, experimentally compare this idea to end-to-end deep learning architectures, and show that our approach reduces the number of interactions required to approximate a suitable policy by a factor of ten.
机译:尽管强化学习方法在各种模拟机器人应用中都取得了成功,但是由于高样本复杂性,端到端训练仍然要花费大量的训练时间,并且无法很好地适应实际系统。在这项工作中,我们通过将领域知识整合到策略学习中来加快强化学习。我们基于计算生物学中已知的多次计算(MMC)原理重新审视体系结构,并使其适应于解决“扩展任务”。我们使用一个简单的MMC网络对策略进行近似,通过实验将该思想与端到端深度学习架构进行比较,并表明我们的方法将近似于合适策略所需的交互次数减少了十分之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号