首页> 中国专利> 一种基于多Agent强化学习的团队机器人决策方法

一种基于多Agent强化学习的团队机器人决策方法

页面导航

摘要
著录项
相似文献

摘要

本发明涉及一种基于多Agent强化学习的团队机器人决策方法，包括以下步骤：采用DQN强化学习方法，初始化网络，随机生成权重，初始化经验回放区；进行网络训练，在每一次与环境的交互中，使用∈‑greedy策略生成机器人的下一步动作；将执行动作后的过渡样本存入经验回放区，随机抽取部分数据用于网络更新；使用梯度下降法更新网络，循环以上步骤，通过不断与环境交互，训练出具有优秀决策能力的值函数网络。本发明采用DQN方法训练团队机器人的决策能力，避免了多Agent带来的的状态空间与动作空间过于复杂的问题，能让机器人具备更加优秀的决策能力。

著录项

公开/公告号CN111898728A

专利类型发明专利
公开/公告日2020-11-06

原文格式PDF
申请/专利权人东南大学;
展开▼

申请/专利号CN202010490427.X
发明设计人田宇飞;
展开▼

申请日2020-06-02
分类号G06N3/04(20060101);G06N20/20(20190101);B25J9/16(20060101);
代理机构32206 南京众联专利代理有限公司;
代理人杜静静
地址 210096 江苏省南京市玄武区四牌楼2号
入库时间 2023-06-19 08:00:20

相似文献

专利
中文文献
外文文献

1. 一种基于多Agent强化学习的团队机器人决策方法 [P] . 中国专利： CN111898728A . 2020-11-06
2. 一种基于记忆关联强化学习的嵌入式实时水下机器人智能决策方法 [P] . 中国专利： CN108762281A . 2018-11-06
3. Designed specifically for the Real Estate Industry, The Digital Agent Finder is a digital Chatbot questionnaire that shortlists the ideal real estate agent/s for the client. By answering seven simple questions, the Digital Agent Finder will shortlist the most suitable agent/s for the client and his/her property, producing a minimum of one and a maximum of three agent/s. The Digital Agent Finder has been divided into three carefully considered categories to ensure accurate matches: (1) about the client’s ideal agent, (2) about the property, and (3) the clients’ interests. [P] . AU2020101404A4 . 2020-08-20

机译： Digital Agent Finder是专门为房地产行业设计的，是一种数字聊天机器人问卷，可以为客户列出理想的房地产经纪人。通过回答七个简单的问题，Digital Agent Finder将为客户和他/她的财产列出最合适的代理商，从而产生最少一个代理商和最多三个代理商。 Digital Agent Finder已分为三类经过仔细考虑的类别，以确保准确匹配：（1）关于客户的理想代理，（2）关于财产，以及（3）客户的利益。
4. A MAGENTA DYE-BASED INK COMPOSITION, AN INK SET AND A METHOD FOR IMPROVING THE PRINT QUALITIES OF A MAGENTA DYE-BASED INK COMPOSITION [P] . 欧洲知识产权局专利： EP3517583A1 . 2019-07-31

机译：一种基于magenta染料的油墨组合物，油墨组和提高基于magenta染料的油墨组合物的印刷质量的方法
5. Pointer-oriented object acquisition method for tangible treatment of information of the computer system which is based on one natural language and in which a received signal reaction of this computer system of artificial intelligence of a cyborg or an android, a corresponding association of this computer system of artificial intelligence of a cyborg or an android, a corresponding thought of this computer system of artificial intelligence of a cyborg or an android are substantiated [P] . 美国专利： US2009265298A1 . 2009-10-22

机译：一种基于指针的对象获取方法，用于对计算机系统的信息进行有形处理，该方法基于一种自然语言，并且该机器人或机器人的人工智能系统对该计算机系统的接收信号作出反应，该计算机系统具有相应的关联机器人或机器人的人工智能，该机器人或机器人的人工智能计算机系统的相应思想得到证实