首页> 外文会议>International Conference on Computer Vision, Image and Deep Learning >Air Combat Strategies Generation of CGF Based on MADDPG and Reward Shaping
【24h】

Air Combat Strategies Generation of CGF Based on MADDPG and Reward Shaping

机译:基于MADDPG的CGF发电和奖励塑造的空战策略

获取原文

摘要

The intelligence of the computer-generated force (CGF) is one of the important problems in air combat simulation. The air combat of CGF is modeled as a two-player zero-sum Markov game. An air combat strategies generation method of CGF is proposed to use a multi-agent deep deterministic policy gradient (MADDPG) algorithm. This paper proposes a potential-based reward shaping method to improve the efficiency of the air combat policy generation algorithm. Finally, the efficiency of the air combat policy generation algorithm and the intelligence level of the resulting policy is verified through simulation experiments. The simulation results show that this method has good convergence and better air combat performance with the strategy obtained by the DDPG algorithm.
机译:计算机生成的力(CGF)的智能是空战模拟中的重要问题之一。 CGF的空战被建模为双人零和马尔可夫游戏。建议使用CGF的空战策略生成方法,以使用多智能体深度确定性政策梯度(MADDPG)算法。本文提出了一种基于潜在的奖励塑形方法,提高了空战策略生成算法的效率。最后,通过仿真实验验证了空战策略生成算法的效率和所产生的策略的智能水平。仿真结果表明,该方法具有良好的收敛性和通过DDPG算法获得的策略具有良好的收敛性和更好的空调性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号