Self-Inference Of Others’ Policies For Homogeneous Agents In Cooperative Multi-Agent Reinforcement Learning

机译：合作多智能经纪人加固学习中众所周知的均质代理商的自我推断

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-agent reinforcement learning (MARL) has been widely applied in various cooperative tasks, where multiple agents are trained to collaboratively achieve global goals. During the training stage of MARL, inferring policies of other agents is able to improve the coordination efficiency. However, most of the existing policy inference methods require each agent to model all other agents separately, which results in quadratic growth of resource consumption as the number of agents increases. In addition, inferring the policy of an agent solely from its observations and actions may lead to failure of agent modeling. To address this issue, we propose to let each agent infer the others’ policies with its own model, given that the agents are homogeneous. This self-inference approach significantly reduces the computation and storage consumption, and guarantees the quality of agent modeling. Experimental results demonstrate effectiveness of the proposed approach.

机译：多智能体增强学习（MARL）已广泛应用于各种合作任务，其中多个代理人受过培训以协作实现全球目标。在Marl的培训阶段，推断其他代理商的政策能够提高协调效率。然而，大多数现有的政策推断方法都需要各种代理商分别模拟所有其他代理，这导致资源消耗的二次生长，因为代理的数量增加。此外，仅从其观察和行动中推断出代理人的政策可能导致代理商建模失败。为了解决这个问题，我们建议让每个代理商通过自己的模型推断其他人的政策，因为代理商是均匀的。这种自动推断方法显着降低了计算和存储消耗，并保证了代理建模的质量。实验结果表明了提出的方法的有效性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|3490-3494|共5页
会议地点
作者
Qifeng Lin; Qing Ling;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Computational modeling; Conferences; Reinforcement learning; Signal processing; Acoustics; Task analysis;

机译：培训;计算建模;会议;强化学习;信号处理;声学;任务分析;

相似文献

外文文献
中文文献
专利

1. Cooperative zone-based rebalancing of idle overhead hoist transportations using multi-agent reinforcement learning with graph representation learning [J] . Kyuree Ahn, Jinkyoo Park AIIE Transactions . 2021,第10期

机译：基于合作区的闲置架起升降机运输与图形表示学习
2. Learning adversarial policy in multiple scenes environment via multi-agent reinforcement learning [J] . Li Yang, Wang Xinzhi, Wang Wei, Connection Science . 2021,第3期

机译：通过多功能钢筋学习在多个场景环境中学习对抗性政策
3. Multi-Agent Deep Reinforcement Learning-Based Cooperative Edge Caching for Ultra-Dense Next-Generation Networks [J] . Chen Shuangwu, Yao Zhen, Jiang Xiaofeng, IEEE Transactions on Communications . 2021,第4期

机译：基于多功能深度加强学习的合作边缘缓存超密集的下一代网络
4. Improved Cooperative Multi-agent Reinforcement Learning Algorithm Augmented by Mixing Demonstrations from Centralized Policy [C] . Hyun-Rok Lee, Taesik Lee International Conference on Autonomous Agents and Multiagent Systems . 2019

机译：通过将示范从集中式策略混合，改进了合作多功能增强学习算法
5. Macro-Action-Based Multi-Agent Deep Reinforcement Learning in Cooperative Tasks [D] . Lu, Xingyu. 2021

机译：基于宏观动作的多智能经济型深度加强学习合作任务
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents [O] . Ming Tan 1993

机译：多智能体强化学习：独立与合作代理

Self-Inference Of Others’ Policies For Homogeneous Agents In Cooperative Multi-Agent Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅