首页> 外文会议>IEEE Congress on Evolutionary Computation >Effects of ensemble action selection with different usage of player's memory resource on the evolution of cooperative strategies for iterated prisoner's dilemma game
【24h】

Effects of ensemble action selection with different usage of player's memory resource on the evolution of cooperative strategies for iterated prisoner's dilemma game

机译:不同使用玩家记忆资源的合奏动作选择对迭代囚徒困境游戏合作策略演变的影响

获取原文

摘要

In our previous study, we proposed an ensemble action selection model where each player has multiple strategies with different memory length for the iterated prisoner's dilemma (IPD) game. An action was suggested by each strategy based on its memory about opponent's single, two or three actions. Majority vote was used for action selection. Under these settings, the evolution of cooperation was examined for various ensembles (i.e., various combinations of strategies). In this paper, we extend our ensemble model to a more general case where strategies have different memory usage. Each strategy of a player has a memory of opponent's and/or player's previous actions. For example, a memory of a strategy can be opponent's single and player's two actions. Another strategy's memory can be player's three actions. Various combinations of strategies for ensemble action selection are examined in this paper. It is shown through computational experiments that the use of ensemble action selection enhances the evolution of cooperation. It is also shown that no cooperation is evolved among strategies with no memory about opponent's actions. An interesting observation is that cooperation is evolved by players with the combination of the following three strategies: two strategies with no memory about opponent's actions, and a single strategy with a memory of both player's and opponent's actions.
机译:在我们之前的研究中,我们提出了一个整体动作选择模型,其中每个玩家对于迭代囚徒困境(IPD)游戏都具有具有不同记忆长度的多种策略。每个策略都根据其对对手的单个,两个或三个动作的记忆来建议一个动作。多数票用于行动选择。在这些情况下,研究了各种合奏(即战略的各种组合)的合作演变。在本文中,我们将整体模型扩展到策略具有不同内存使用情况的更一般的情况。玩家的每种策略都有对对手和/或玩家以前的动作的记忆。例如,一个策略的记忆可以是对手的单一动作和玩家的两个动作。另一个策略的记忆可以是玩家的三个动作。本文研究了整体动作选择策略的各种组合。通过计算实验表明,合奏动作选择的使用促进了合作的发展。还表明,在没有记忆对手行为的情况下,策略之间不会形成合作。有趣的观察是,玩家通过以下三种策略的组合来发展合作:两种策略不记忆对手的动作,而单个策略则记忆玩家和对手的动作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号