首页> 外国专利> REINFORCEMENT LEARNING SYSTEM SUPPORTING DELAY COMPENSATION AND REINFORCEMENT LEARNING METHOD THEREOF

REINFORCEMENT LEARNING SYSTEM SUPPORTING DELAY COMPENSATION AND REINFORCEMENT LEARNING METHOD THEREOF

机译：支持延迟补偿的强化学习系统及其强化学习方法

页面导航

摘要
著录项
相似文献

摘要

The present invention relates to a reinforcement learning method for supporting delay compensation in a reinforcement learning system. The reinforcement learning method using a reinforcement learning agent of the present invention includes the steps of: receiving an immediate compensation value and a delay compensation value associated with the control action from the environmental system; generating a final compensation value corresponding to the control action by considering the received immediate compensation value and the delay compensation value together; and generating a transition tuple including the final reward value, and applying the generated transition tuple to the reinforcement learning agent to perform reinforcement learning. Since the delay compensation value measured by being delayed in the environmental system can be applied to the directly related control action, the performance and speed of the reinforcement learning system can be increased.;COPYRIGHT KIPO 2020

机译：本发明涉及一种用于在强化学习系统中支持延迟补偿的强化学习方法。使用本发明的强化学习代理的强化学习方法包括以下步骤：从环境系统接收与控制动作相关的立即补偿值和延迟补偿值;以及通过一起考虑接收到的立即补偿值和延迟补偿值来产生与控制动作相对应的最终补偿值;生成包括最终奖励值的过渡元组，并将生成的过渡元组应用于强化学习代理以进行强化学习。由于可以通过在环境系统中延迟测得的延迟补偿值应用于直接相关的控制动作，因此可以提高强化学习系统的性能和速度。; COPYRIGHT KIPO 2020

著录项

公开/公告号KR20200061653A

专利类型
公开/公告日2020-06-03

原文格式PDF
申请/专利权人 ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE;
展开▼

申请/专利号KR20180147199
发明设计人 CHO CHUNG LAE;SHIN SEUNG JAE;YOON SEUNG HYUN;JEON HONG SEOK;
展开▼

申请日2018-11-26
分类号G06N99;
国家 KR
入库时间 2022-08-21 11:06:56

相似文献

专利
外文文献
中文文献