首页> 外国专利> DISTRIBUTED STRENGTHENING LEARNING METHOD FOR INTEGRATING EXPERIENCE STRENGTHENING TYPE STRENGTHENING LEARNING METHOD AND ENVIRONMENT IDENTIFICATION TYPE STRENGTHENING LEARNING METHOD BY USING MULTI-AGENT MODEL

DISTRIBUTED STRENGTHENING LEARNING METHOD FOR INTEGRATING EXPERIENCE STRENGTHENING TYPE STRENGTHENING LEARNING METHOD AND ENVIRONMENT IDENTIFICATION TYPE STRENGTHENING LEARNING METHOD BY USING MULTI-AGENT MODEL

机译:综合多经验模型的经验强化型强化学习方法与环境识别型强化学习方法的分布式强化学习方法

摘要

PROBLEM TO BE SOLVED: To reduce trial frequency required for learning and to provide the system with a robust property in a dynamic environmental change by integrating an experience strengthening type strengthening learning method and an environment identification type strengthening learning method by using a multi-agent model for strengthening learning to be executed so as to be autonomously applied to an environment. ;SOLUTION: When plural candidates exist, an environment strengthening agent selects one of plural candidates at random (S11). The moved state is registered in an episode registering table (S13) and whether a reward is paid or not is checked (S15). When the reward is paid, an environment identification agent is generated (S17), and when the reward is not paid, whether the candidate meets a storage module or not is checked (S19). When the candidate does not meet the storage module, the same processing is repeated (S11), but when the candidate meets the storage module and when the environment identification agent is generated, strengthening values are set up in respective states registered in the episode registering table and then the table is initialized (S25).;COPYRIGHT: (C)2000,JPO
机译:要解决的问题:通过使用多智能体模型整合体验增强型强化学习方法和环境识别型强化学习方法,以减少学习所需的试用频率并在动态环境变化中为系统提供强大的属性。为了加强要执行的学习以便自主地应用于环境。 ;解决方案:当存在多个候选者时,环境增强剂从多个候选者中随机选择一个(S11)。将移动状态登记在情节登记表中(S13),并检查是否支付了奖励(S15)。当支付奖励时,生成环境识别代理(S17),并且当不支付奖励时,检查候选者是否满足存储模块(S19)。当候选者不满足存储模块时,重复相同的处理(S11),但是当候选者满足存储模块时并且当生成环境识别代理时,在情节注册表中注册的各个状态中设置加强值。然后初始化表格(S25)。COPYRIGHT:(C)2000,JPO

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号