UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy

机译：基于深度加强学习策略的无人机自主追踪与降落

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Unmanned aerial vehicle (UAV) autonomous tracking and landing is playing an increasingly important role in military and civil applications. In particular, machine learning has been successfully introduced to robotics-related tasks. A novel UAV autonomous tracking and landing approach based on a deep reinforcement learning strategy is presented in this paper, with the aim of dealing with the UAV motion control problem in an unpredictable and harsh environment. Instead of building a prior model and inferring the landing actions based on heuristic rules, a model-free method based on a partially observable Markov decision process (POMDP) is proposed. In the POMDP model, the UAV automatically learns the landing maneuver by an end-to-end neural network, which combines the Deep Deterministic Policy Gradients (DDPG) algorithm and heuristic rules. A Modular Open Robots Simulation Engine (MORSE)-based reinforcement learning framework is designed and validated with a continuous UAV tracking and landing task on a randomly moving platform in high sensor noise and intermittent measurements. The simulation results show that when the moving platform is moving in different trajectories, the average landing success rate of the proposed algorithm is about 10% higher than that of the Proportional-Integral-Derivative (PID) method. As an indirect result, a state-of-the-art deep reinforcement learning-based UAV control method is validated, where the UAV can learn the optimal strategy of a continuously autonomous landing and perform properly in a simulation environment.

机译：无人驾驶飞行器（UAV）自主追踪和着陆正在军事和民用应用中发挥着越来越重要的作用。特别是，已经成功地引入了与机器人有关的任务的机器学习。本文提出了一种基于深度加强学习策略的新型无人机自主追踪和着陆方法，目的是在不可预测和严酷的环境中处理无人机运动控制问题。提出了一种基于启发式规则来推断出基于启发式规则的预测动作，而不是基于局部观察到的马尔可夫决策过程（POMDP）的模型方法而不是构建先前的模型。在POMDP模型中，UAV通过端到端神经网络自动学习着陆机构，该网络结合了深度确定性政策梯度（DDPG）算法和启发式规则。基于模块化开放机器人仿真发动机（MORSE）的加固学习框架是在高传感器噪声和间歇测量的随机移动平台上的连续无人机跟踪和着陆任务设计和验证。仿真结果表明，当移动平台在不同的轨迹中移动时，所提出的算法的平均着陆成功率比比例积分（PID）方法高约10％。作为间接结果，验证了基于最先进的深增强学习的UAV控制方法，其中UAV可以学习连续自主着陆的最佳策略，并在模拟环境中正确执行。

著录项

期刊名称 Sensors (Basel Switzerland)
作者
Jingyi Xie; Xiaodong Peng; Haijiao Wang; Wenlong Niu; Xiao Zheng;
展开▼
作者单位

展开▼
年(卷),期 2020(20),19
年度 2020
页码 5630
总页数 17
原文格式 PDF
正文语种
中图分类
关键词
quadrotor unmanned aerial vehicle; deep reinforcement learning; autonomous tracking and landing;

机译：Quadrotor无人驾驶车辆;深增强学习;自主追踪和着陆;

相似文献

外文文献
中文文献
专利

1. A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a Moving Platform [J] . Rodriguez-Ramos Alejandro, Sampedro Carlos, Bavle Hriday, Journal of Intelligent & Robotic Systems: Theory & Application . 2019,第1a2期

机译：移动平台无人机自主着陆的深度加强学习策略
2. Vision-Based Autonomous Navigation Approach for a Tracked Robot Using Deep Reinforcement Learning [J] . Muhammad Mudassir Ejaz, Tong Boon Tang, Cheng-Kai Lu Sensors Journal, IEEE . 2021,第2期

机译：基于视觉的自主导航方法，用于追踪机器人使用深加固学习
3. Autonomous Navigation of UAVs in Large-Scale Complex Environments: A Deep Reinforcement Learning Approach [J] . Wang Chao, Wang Jian, Shen Yuan, IEEE Transactions on Vehicular Technology . 2019,第3期

机译：大型复杂环境中无人机的自主导航：一种深度强化学习方法
4. Toward End-to-End Control for UAV Autonomous Landing via Deep Reinforcement Learning [C] . Riccardo Polvara, Massimiliano Patacchiola, Sanjay Sharma, International Conference on Unmanned Aircraft Systems . 2018

机译：通过深度强化学习实现无人机自主着陆的端到端控制
5. Visual Object Tracking for UAVs Using Deep Reinforcement Learning [D] . Ko, Kyungtae. 2020

机译：使用深增强学习的无人机的视觉对象跟踪
6. A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning [O] . Ke Li, Kun Zhang, Zhenchong Zhang, 2021

机译：基于深度加强学习的自主空腹措施无人机机动决策算法
7. UAV Autonomous Aerial Combat Maneuver Strategy Generation with Observation Error Based on State-Adversarial Deep Deterministic Policy Gradient and Inverse Reinforcement Learning [O] . Weiren Kong, Deyun Zhou, Zhen Yang, 2020

机译：无人机自动空中作战机动策略生成基于国家对冲深度确定性政策梯度和反增强学习的观察误差

UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy

摘要

著录项

相似文献

相关主题

期刊订阅