首页> 外国专利> REINFORCEMENT LEARNING SYSTEM SUPPORTING DELAY COMPENSATION AND REINFORCEMENT LEARNING METHOD THEREOF

REINFORCEMENT LEARNING SYSTEM SUPPORTING DELAY COMPENSATION AND REINFORCEMENT LEARNING METHOD THEREOF

机译:支持延迟补偿的强化学习系统及其强化学习方法

摘要

The present invention relates to a reinforcement learning method for supporting delay compensation in a reinforcement learning system. The reinforcement learning method using a reinforcement learning agent of the present invention includes the steps of: receiving an immediate compensation value and a delay compensation value associated with the control action from the environmental system; generating a final compensation value corresponding to the control action by considering the received immediate compensation value and the delay compensation value together; and generating a transition tuple including the final reward value, and applying the generated transition tuple to the reinforcement learning agent to perform reinforcement learning. Since the delay compensation value measured by being delayed in the environmental system can be applied to the directly related control action, the performance and speed of the reinforcement learning system can be increased.;COPYRIGHT KIPO 2020
机译:本发明涉及一种用于在强化学习系统中支持延迟补偿的强化学习方法。使用本发明的强化学习代理的强化学习方法包括以下步骤:从环境系统接收与控制动作相关的立即补偿值和延迟补偿值;以及通过一起考虑接收到的立即补偿值和延迟补偿值来产生与控制动作相对应的最终补偿值;生成包括最终奖励值的过渡元组,并将生成的过渡元组应用于强化学习代理以进行强化学习。由于可以通过在环境系统中延迟测得的延迟补偿值应用于直接相关的控制动作,因此可以提高强化学习系统的性能和速度。; COPYRIGHT KIPO 2020

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号