Model-based reinforcement learning for a multi-player card game with partial observability

机译：具有部分可观察性的多人纸牌游戏的基于模型的强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article presents a model-based reinforcement learning (RL) scheme for a card game, "Hearts". Since this is a large-scale multi-player game with partial observability, effective state estimation and optimal control based on an environmental model are required. In our method, the learning agent is controlled by a one-step-ahead utility prediction using opponent agents' models. The computational intractability is overcome by the sampling method over a specific subspace. Simulation results show that our model-based RL method can produce an agent comparable to a human expert for this realistic problem.

机译：本文介绍了一种基于模型的纸牌游戏“心”的强化学习（RL）方案。由于这是具有部分可观察性的大型多人游戏，因此需要有效的状态估计和基于环境模型的最佳控制。在我们的方法中，通过使用对手代理模型的一步一步效用预测来控制学习代理。通过在特定子空间上的采样方法可以克服计算上的棘手性。仿真结果表明，针对该现实问题，基于模型的RL方法可以产生与人类专家相当的代理。

著录项

来源
《Intelligent Agent Technology, IEEE/WIC/ACM International Conference on》|2005年|P.467-470|共4页
会议地点
作者
Fujita H.; Ishii S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术;
关键词
computer games; learning (artificial intelligence); multi-agent systems; environmental model; model-based reinforcement learning; multiplayer card game; one-step-ahead utility prediction; optimal control; partial observability; sampling method; state estimation;

机译：电脑游戏;学习（人工智能）;多主体系统;环境模型;基于模型的强化学习;多人纸牌游戏;一步一步效用预测;最优控制;局部可观性;采样方法;状态估计;

相似文献

外文文献
中文文献
专利

1. Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation [J] . Hajime Fujita, Shin Ishii Neural computation . 2007,第11期

机译：具有基于采样状态估计的部分可观察游戏的基于模型的强化学习
2. Temporally extended features in model-based reinforcement learning with partial observability [J] . Lieck Robert, Toussaint Marc Neurocomputing . 2016,第juna5期

机译：具有部分可观察性的基于模型的强化学习中的临时扩展功能
3. Cooperative control for multi-player pursuit-evasion games with reinforcement learning [J] . Wang Yuanda, Dong Lu, Sun Changyin Neurocomputing . 2020,第Octa28期

机译：利用加固学习的多人追求逃避游戏的合作控制
4. Model-based reinforcement learning for a multi-player card game with partial observability [C] . Fujita H., Ishii S. IEEE/WIC/ACM International Conference on Intelligent Agent Technology . 2005

机译：基于模型的强化学习，用于部分可观察性的多人纸牌游戏
5. Understanding Model-Based Reinforcement Learning and its Application in Safe Reinforcement Learning [D] . Hu, Dingcheng . 2019

机译：了解基于模型的强化学习及其在安全强化学习中的应用
6. PRISM-games: verification and strategy synthesis for stochastic multi-player games with multiple objectives [O] . Marta Kwiatkowska, David Parker, Clemens Wiltsche -1

机译：PRISM游戏：具有多个目标的随机多玩家游戏的验证和策略综合
7. Model-Based Reinforcement Learning for a Multi-Player Card Game with Partial Observability [O] . Hajime Fujita, Shin Ishii 2005

机译：基于模型的具有部分可观察性的多人纸牌游戏强化学习

Model-based reinforcement learning for a multi-player card game with partial observability

摘要

著录项

相似文献

相关主题

期刊订阅