首页> 外国专利> DISTRIBUTED STRENGTHENING LEARNING METHOD FOR INTEGRATING EXPERIENCE STRENGTHENING TYPE STRENGTHENING LEARNING METHOD AND ENVIRONMENT IDENTIFICATION TYPE STRENGTHENING LEARNING METHOD BY USING MULTI-AGENT MODEL

DISTRIBUTED STRENGTHENING LEARNING METHOD FOR INTEGRATING EXPERIENCE STRENGTHENING TYPE STRENGTHENING LEARNING METHOD AND ENVIRONMENT IDENTIFICATION TYPE STRENGTHENING LEARNING METHOD BY USING MULTI-AGENT MODEL

机译：综合多经验模型的经验强化型强化学习方法与环境识别型强化学习方法的分布式强化学习方法

页面导航

摘要
著录项
相似文献

摘要

PROBLEM TO BE SOLVED: To reduce trial frequency required for learning and to provide the system with a robust property in a dynamic environmental change by integrating an experience strengthening type strengthening learning method and an environment identification type strengthening learning method by using a multi-agent model for strengthening learning to be executed so as to be autonomously applied to an environment. ;SOLUTION: When plural candidates exist, an environment strengthening agent selects one of plural candidates at random (S11). The moved state is registered in an episode registering table (S13) and whether a reward is paid or not is checked (S15). When the reward is paid, an environment identification agent is generated (S17), and when the reward is not paid, whether the candidate meets a storage module or not is checked (S19). When the candidate does not meet the storage module, the same processing is repeated (S11), but when the candidate meets the storage module and when the environment identification agent is generated, strengthening values are set up in respective states registered in the episode registering table and then the table is initialized (S25).;COPYRIGHT: (C)2000,JPO

机译：要解决的问题：通过使用多智能体模型整合体验增强型强化学习方法和环境识别型强化学习方法，以减少学习所需的试用频率并在动态环境变化中为系统提供强大的属性。为了加强要执行的学习以便自主地应用于环境。 ;解决方案：当存在多个候选者时，环境增强剂从多个候选者中随机选择一个（S11）。将移动状态登记在情节登记表中（S13），并检查是否支付了奖励（S15）。当支付奖励时，生成环境识别代理（S17），并且当不支付奖励时，检查候选者是否满足存储模块（S19）。当候选者不满足存储模块时，重复相同的处理（S11），但是当候选者满足存储模块时并且当生成环境识别代理时，在情节注册表中注册的各个状态中设置加强值。然后初始化表格（S25）。COPYRIGHT：（C）2000，JPO

著录项

公开/公告号JP2000020494A

专利类型
公开/公告日2000-01-21

原文格式PDF
申请/专利权人 NIPPON TELEGR & TELEPH CORP NTT;
展开▼

申请/专利号JP19980192101
发明设计人 KURIHARA SATOSHI;SUGAWARA TOSHIHARU;
展开▼

申请日1998-07-07
分类号G06F15/18;G06F9/44;
国家 JP
入库时间 2022-08-22 02:01:43

相似文献

专利
外文文献
中文文献