A NEW LEARNING ALGORITHM FOR COOPERATIVE AGENTS IN GENERAL-SUM GAMES

机译：通用和游戏中合作代理的新学习算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The development of multi-agent reinforcement learning in stochastic game has been slowed down in recent years.The main problem is that it is difficult to make the learning satisfy rationality and convergence at the same time.Here, the typical learning algorithms are analyzed firstly, and then a new method called Pareto-Q is prompted with the concept of Pareto optimum, which is rational.At the same time, social conventions are also introduced to promise the convergence of learning.At the last, experiments are presented to prove the good learning result of this algorithm.

机译：近年来，随机游戏中多智能体强化学习的发展一直在放缓，主要问题在于难以使学习同时满足合理性和收敛性。在此，首先分析典型的学习算法，然后以帕累托最优的概念提出了一种称为帕累托Q的新方法，该方法是合理的。与此同时，还引入了社会习俗来保证学习的收敛性。最后，通过实验证明了这一方法的优越性。该算法的学习结果。

著录项

来源
《Proceedings of the 2007 International Conference on Machine Learning and Cybernetics》|2007年|P.50-54|共5页
会议地点
作者
MEI-PING SONG; JU-BAI AN; RONG CHEN;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
MAS; Reinforcement learning; Pareto optimum; Social conventions;

机译：MAS;强化学习;帕累托最优;社会习俗;
入库时间 2022-08-26 14:56:20

相似文献

外文文献
中文文献
专利

1. Multi-agent Inverse Reinforcement Learning for Certain General-Sum Stochastic Games [J] . Lin Xiaomin, Adams Stephen C., Beling Peter A. The Journal of Artificial Intelligence Research . 2019,第期

机译：用于某一般性加速游戏的多功能逆钢筋学习
2. Multi-agent Inverse Reinforcement Learning for Certain General-sum Stochastic Games [J] . Xiaomin Lin, Stephen C. Adams, Peter A. Beling The Journal of Artificial Intelligence Research . 2019,第7期

机译：某些通用随机游戏的多代理逆钢筋学习
3. DECENTRALIZED LEARNING IN GENERAL-SUM MATRIX GAMES:AN L_(R-I) LAGGING ANCHOR ALGORITHM [J] . XIAOSONG LU, HOWARD M. SCHWARTZ International Journal of Innovative Computing Information and Control . 2013,第1期

机译：通用和游戏中的分散学习：L_（R-I）滞后锚算法
4. A NEW LEARNING ALGORITHM FOR COOPERATIVE AGENTS IN GENERAL-SUM GAMES [C] . MEI-PING SONG, JU-BAI AN, RONG CHEN International Conference on Machine Learning and Cybernetics . 2007

机译：一般和游戏中的合作代理新学习算法
5. The analysis and design of concurrent learning algorithms for cooperative multiagent systems. [D] . Panait, Liviu. 2007

机译：协同多主体系统并发学习算法的分析与设计。
6. Network partitioning algorithms as cooperative games [O] . Konstantin E. Avrachenkov, Aleksei Y. Kondratev, Vladimir V. Mazalov, -1

机译：网络分区算法作为合作游戏
7. Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games [O] . Prasad, H. L, Prashanth, L. A., Bhatnagar, Shalabh 2015

机译：N-player中学习纳什均衡的演员批评算法一般和游戏

A NEW LEARNING ALGORITHM FOR COOPERATIVE AGENTS IN GENERAL-SUM GAMES

摘要

著录项

相似文献

相关主题

期刊订阅