【24h】

Decaying Simulation Strategies

机译:衰减模拟策略

获取原文
获取原文并翻译 | 示例
           

摘要

The aim of general game playing (GGP) is to create programs capable of playing a wide range of different games at an expert level, given only the rules of the game. The most successful GGP programs currently employ simulation-based Monte Carlo tree search (MCTS). The performance of MCTS depends heavily on the simulation strategy used. In this paper, we investigate the application of a decay factor for two domain-independent simulation strategies: the -gram selection technique (NST) and the move-average sampling technique (MAST). Three decay factor methods, called move decay, batch decay, and simulation decay, are applied. Furthermore, a combination of move decay and simulation decay is also tested. The decay variants are implemented in the GGP program CadiaPlayer. Four types of games are used: turn taking, simultaneous move, one player, and multiplayer. Except for one-player games, experiments show that decaying can significantly improve the performance of both NST and MAST simulation strategies.
机译:通用游戏(GGP)的目的是创建仅在游戏规则下就能够在专家级别上玩各种不同游戏的程序。当前,最成功的GGP程序采用基于仿真的蒙特卡洛树搜索(MCTS)。 MCTS的性能在很大程度上取决于所使用的仿真策略。在本文中,我们研究了衰减因子在两种独立于域的仿真策略中的应用:-gram选择技术(NST)和移动平均采样技术(MAST)。应用了三种衰减因子方法,分别称为移动衰减,批量衰减和模拟衰减。此外,还测试了移动衰减和模拟衰减的组合。衰减变体在GGP程序CadiaPlayer中实现。使用四种类型的游戏:回合,同时移动,一名玩家和多人游戏。除单人游戏外,实验表明衰减可以显着提高NST和MAST仿真策略的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号