首页> 中国专利> 一种多Agent深度强化学习算法

一种多Agent深度强化学习算法

页面导航

摘要
著录项
相似文献

摘要

本发明公开了一种多Agent深度强化学习算法，包括：S1：学习的策略在执行时只使用本地信息，即它们自己的观察结果；S2：智能体之间的通信方法不做任何结构上的假设，即不假设一个可区分的通信渠道；S3：leader层网络只输入全局智能体的状态信息，只作用于每个智能体的输出动作值，并不参与每个智能体的策略执行，既保持每个智能体的独立性，又保证所有智能体群体之间的通信。本发明通过在分布式执行之前加入集中式预判分配权重，增强了智能体群体之间的决策通信，提高了Leader_MADDPG在多变的环境关系中智能体训练过程中的稳定性和训练后的鲁棒性。

著录项

公开/公告号CN113902087A

专利类型发明专利
公开/公告日2022-01-07

原文格式PDF
申请/专利权人吉林建筑大学;
展开▼

申请/专利号CN202111240522.5
发明设计人王旭;张宇;郭秀娟;徐勇;尤天舒;富倩;孙伟;刘钢;戴传祗;吴程巍;
展开▼

申请日2021-10-25
分类号G06N3/00(20060101);G06N3/04(20060101);G06N3/08(20060101);G06N20/00(20190101);
代理机构61248 西安合创非凡知识产权代理事务所(普通合伙);
代理人支思迪
地址 130118 吉林省长春市新城大街5088号
入库时间 2023-06-19 13:35:32

相似文献

专利
中文文献
外文文献

1. 一种多Agent深度强化学习算法 [P] . 中国专利： CN113902087A . 2022-01-07
2. 一种多Agent深度强化学习的单件作业车间调度方法 [P] . 中国专利： CN111985672B . 2021.08.27
3. Designed specifically for the Real Estate Industry, The Digital Agent Finder is a digital Chatbot questionnaire that shortlists the ideal real estate agent/s for the client. By answering seven simple questions, the Digital Agent Finder will shortlist the most suitable agent/s for the client and his/her property, producing a minimum of one and a maximum of three agent/s. The Digital Agent Finder has been divided into three carefully considered categories to ensure accurate matches: (1) about the client’s ideal agent, (2) about the property, and (3) the clients’ interests. [P] . AU2020101404A4 . 2020-08-20

机译： Digital Agent Finder是专门为房地产行业设计的，是一种数字聊天机器人问卷，可以为客户列出理想的房地产经纪人。通过回答七个简单的问题，Digital Agent Finder将为客户和他/她的财产列出最合适的代理商，从而产生最少一个代理商和最多三个代理商。 Digital Agent Finder已分为三类经过仔细考虑的类别，以确保准确匹配：（1）关于客户的理想代理，（2）关于财产，以及（3）客户的利益。
4. Ectoparasiticide Agent, Process for the production of said Agent using a compound against insects and parasites and ticks monofagas production for that Agent. [P] . AR003433A1 . 1998-08-05

机译： Ectoparasiticide Agent，使用一种抗昆虫和寄生虫的化合物生产所述Agent的过程，并s杀该Agent的单fagas生产。
5. System for the energy saving pre-cooling/heating training of an air conditioner using deep reinforcement learning algorithm based on the user location living climate condition and method thereof [P] . 韩国专利： KR102131414B1 . 2020-07-08

机译：基于用户所在地生活气候条件的深度强化学习算法的空调节能预冷/热训系统及方法