Air Combat Strategies Generation of CGF Based on MADDPG and Reward Shaping

机译：基于MADDPG的CGF发电和奖励塑造的空战策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The intelligence of the computer-generated force (CGF) is one of the important problems in air combat simulation. The air combat of CGF is modeled as a two-player zero-sum Markov game. An air combat strategies generation method of CGF is proposed to use a multi-agent deep deterministic policy gradient (MADDPG) algorithm. This paper proposes a potential-based reward shaping method to improve the efficiency of the air combat policy generation algorithm. Finally, the efficiency of the air combat policy generation algorithm and the intelligence level of the resulting policy is verified through simulation experiments. The simulation results show that this method has good convergence and better air combat performance with the strategy obtained by the DDPG algorithm.

机译：计算机生成的力（CGF）的智能是空战模拟中的重要问题之一。 CGF的空战被建模为双人零和马尔可夫游戏。建议使用CGF的空战策略生成方法，以使用多智能体深度确定性政策梯度（MADDPG）算法。本文提出了一种基于潜在的奖励塑形方法，提高了空战策略生成算法的效率。最后，通过仿真实验验证了空战策略生成算法的效率和所产生的策略的智能水平。仿真结果表明，该方法具有良好的收敛性和通过DDPG算法获得的策略具有良好的收敛性和更好的空调性能。

著录项

来源
《International Conference on Computer Vision, Image and Deep Learning》|2020年|651-655|共5页
会议地点
作者
Weiren KONG; Deyun ZHOU; Zhen YANG;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Conferences; Autonomous agents; Multi-agent systems; Aircraft; Aerospace control; Aerodynamics; Training;

机译：会议;自主代理;多助剂系统;飞机;航空航天控制;空气动力学;培训;

相似文献

外文文献
中文文献
专利

1. Virtual motion camouflage based phantom track generation through cooperative electronic combat air vehicles [J] . Yunjun Xu, Gareth Basset Automatica . 2010,第9期

机译：通过协作电子作战飞机基于虚拟运动伪装的幻像轨迹生成
2. Evasive Maneuver Strategy for UCAV in Beyond-Visual-Range Air Combat Based on Hierarchical Multi-Objective Evolutionary Algorithm [J] . Yang Zhen, Zhou Deyun, Piao Haiyin, Quality Control, Transactions . 2020,第期

机译：基于分层多目标进化算法的超视距空战中的UCAV的避免机动策略
3. Triage and Air Evacuation Strategy for Mass Casualty Events: A Model Based on Combat Experience [J] . Yuval Ran Eran Hadad Saleh Daher Ori Ganor Yana Yegorov Udi Katzenell Nachman Ash Gil Hirschhorn Military Medicine . 2011,第6期

机译：重大伤亡事件的分类和撤离策略：基于战斗经验的模型
4. Differential Flatness-based Optimal Air Combat Maneuver Strategy Generation [C] . Baris Baspinar, Emre Koyuncu AIAA SciTech forum and exposition . 2019

机译：基于差分平面度的最优空战机动策略生成
5. Generation of site-specific DNA-polypeptide cross-links mediated by Schiff base chemistry: A novel strategy to investigate the cellular pathways that repair DNA-protein cross-link damage. [D] . Kurtz, Andrew James. 2003

机译：由席夫碱化学介导的位点特异性DNA多肽交联的生成：研究修复DNA蛋白质交联损伤的细胞途径的新策略。
6. Shaping Attention with Reward: Effects of Reward on Space- and Object-Based Selection [O] . Sarah Shomstein, Jacoba Johnson -1

机译：通过奖励塑造注意力：奖励对基于空间和基于对象的选择的影响
7. Modelling CGFs for tactical air-to-air combat trainingMotivation-based behaviour and Machine Learning in a common architecture [O] . Roessingh J.J.M., Rijken R., Merk R.J., 2011

机译：战术空战训练的CGF建模通用架构中基于动机的行为和机器学习
8. Global Combat Support Basing. Robust Prepositioning Strategies for Air Force War Reserve Material [R] . McGarbey, R. G., Tripp, R. S., Rue, R., 2010

机译：全球战斗支持基础。空军战备物质的鲁棒预定位策略

Air Combat Strategies Generation of CGF Based on MADDPG and Reward Shaping

摘要

著录项

相似文献

相关主题

期刊订阅