首页> 外文会议>Machine Learning and Applications, 2009. ICMLA '09 >Theoretical and Empirical Analysis of Reward Shaping in Reinforcement Learning
【24h】

Theoretical and Empirical Analysis of Reward Shaping in Reinforcement Learning

机译:强化学习中奖赏塑形的理论与实证分析

获取原文

摘要

Reinforcement learning suffers scalability problems due to the state space explosion and the temporal credit assignment problem. Knowledge-based approaches have received a significant attention in the area. Reward shaping is a particular approach to incorporate domain knowledge into reinforcement learning. Theoretical and empirical analysis of this paper reveals important properties of this principle, especially the influence of the reward type, MDP discount factor, and the way of evaluating the potential function on the performance.
机译:增强学习由于状态空间爆炸和时间信用分配问题而遭受可伸缩性问题。基于知识的方法在该领域受到了极大的关注。奖励塑造是一种将领域知识纳入强化学习的特殊方法。本文的理论和经验分析揭示了该原理的重要性质,特别是奖励类型,MDP折现因子的影响以及潜在功能对绩效的评估方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号