Object-Oriented Reinforcement Learning in Cooperative Multiagent Domains

机译：协作多主体领域中的面向对象的强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although Reinforcement Learning methods have successfully been applied to increasingly large problems, scalability remains a central issue. While Object-Oriented Markov Decision Processes (OO-MDP) are used to exploit regularities in a domain, Multiagent System (MAS) methods are used to divide workload amongst multiple agents. In this work we propose a novel combination of OO-MDP and MAS, called Multiagent Object-Oriented Markov Decision Process (MOO-MDP), so as to accrue the benefits of both strategies and be able to better address scalability issues. We present an algorithm to solve deterministic cooperative MOO-MDPs, and prove that it learns optimal policies while reducing the learning space by exploiting state abstractions. We experimentally compare our results with earlier approaches and show advantages with regard to discounted cumulative reward, number of steps to fulfill the task, and Q-table size.

机译：尽管强化学习方法已成功应用于越来越大的问题，但可扩展性仍然是中心问题。虽然使用面向对象的马尔可夫决策过程（OO-MDP）来利用域中的规则性，但是使用多代理系统（MAS）方法来在多个代理之间分配工作量。在这项工作中，我们提出了OO-MDP和MAS的新颖组合，称为多代理面向对象的马尔可夫决策过程（MOO-MDP），以便从这两种策略中受益，并能够更好地解决可伸缩性问题。我们提出了一种解决确定性合作MOO-MDP的算法，并证明了它在学习最优策略的同时通过利用状态抽象来减少学习空间。我们通过实验将我们的结果与早期方法进行比较，并显示出在折扣累积奖励，完成任务的步骤数以及Q表大小方面的优势。

著录项

来源
《Brazilian Conference on Intelligent Systems》|2016年|19-24|共6页
会议地点
作者
Felipe Leno da Silva; Ruben Glatt; Anna Helena Reali Costa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Gold; Scalability; Learning (artificial intelligence); Concrete; Object oriented modeling; Markov processes; Multi-agent systems;

机译：金级;可伸缩性;学习（人工智能）;具体;面向对象的建模;马尔可夫过程;多智能体系统;

相似文献

外文文献
中文文献
专利

1. MOO-MDP: An Object-Oriented Representation for Cooperative Multiagent Reinforcement Learning [J] . Da Silva Felipe Leno, Glatt Ruben, Reali Costa Anna Helena Cybernetics, IEEE Transactions on . 2019,第2期

机译：MOO-MDP：协作多代理强化学习的面向对象表示
2. Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems [J] . JIANYE HAO, HO-FUNG LEUNG, ZHONG MING ACM transactions on autonomous and adaptive systems . 2015,第4期

机译：协作式多智能体系统中的多智能体增强社会学习以促进协调
3. Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems [J] . Sun Changyin, Liu Wenzhang, Dong Lu Neural Networks and Learning Systems, IEEE Transactions on . 2021,第5期

机译：与合作多算系统的任务分解的加强学习
4. Object-Oriented Reinforcement Learning in Cooperative Multiagent Domains [C] . Felipe Leno da Silva, Ruben Glatt, Anna Helena Reali Costa Brazilian Conference on Intelligent Systems . 2016

机译：面向对象的加固学习在合作多书域中
5. Scalable cooperative multiagent reinforcement learning in the context of an organization. [D] . Abdallah, Sherief. 2006

机译：在组织环境中的可扩展协作式多主体强化学习。
6. Multiagent cooperation and competition with deep reinforcement learning [O] . Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, -1

机译：多主体合作与竞争与深度强化学习
7. Adaptive Learning: A New Decentralized Reinforcement Learning Approach for Cooperative Multiagent Systems [O] . Meng-Lin Li, Shaofei Chen, Jing Chen 2020

机译：适应性学习：合作多读系统的新分散加固学习方法

Object-Oriented Reinforcement Learning in Cooperative Multiagent Domains

摘要

著录项

相似文献

相关主题

期刊订阅