UAV navigation in high dynamic environments:A deep reinforcement learning approach

来源 :中国航空学报(英文版) | 被引量 : 0次 | 上传用户:wxjct
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Unmanned Aerial Vehicle (UAV) navigation is aimed at guiding a UAV to the desired destinations along a collision-free and efficient path without human interventions, and it plays a crucial role in autonomous missions in harsh environments. The recently emerging Deep Reinforce-ment Learning (DRL) methods have shown promise for addressing the UAV navigation problem, but most of these methods cannot converge due to the massive amounts of interactive data when a UAV is navigating in high dynamic environments, where there are numerous obstacles moving fast. In this work, we propose an improved DRL-based method to tackle these fundamental limitations. To be specific, we develop a distributed DRL framework to decompose the UAV navigation task into two simpler sub-tasks, each of which is solved through the designed Long Short-Term Memory (LSTM) based DRL network by using only part of the interactive data. Furthermore, a clipped DRL loss function is proposed to closely stack the two sub-solutions into one integral for the UAV navigation problem. Extensive simulation results are provided to corroborate the superiority of the proposed method in terms of the convergence and effectiveness compared with those of the state-of-the-art DRL methods.
其他文献
【摘要】新课程标准更加强调学生语言综合运用的能力。对于农村地区高一学生要注意与初中的衔接过渡,使之尽快适应高中英语教学;要始终贯彻交际性原则,强化学生主体意识;培养学生的自学能力,发挥学习的积极性和主动性;激发学生学习英语的兴趣,增强教学效果;培养学生的思维能力和思维品质,在平时的英语学习中多问为什么;帮助学生归纳和总结,不断提高英语成绩。  【关键词】高一学生 主体意识 自学能力 兴趣 多问 归
某军用装备为满足GJB151A《军用设备和分系统电磁发射和敏感度要求》,必须采取措施抑制低频电流谐波,以通过CE101标准.为满足军用装备体积重量的限制要求,可以采用并联有源电
会议
To solve the rapid transient control problem of Flight Environment Simulation System (FESS) of Altitude Ground Test Facilities (AGTF) with large heat transfer u
电容式电压互感器由于其自身的结构和工作特性,在进行投切电容器等系统操作时,其高压侧熔断器经常发生异常爆裂事故.结合实际工程案例,通过对某一变电站进行电力电容器投切试
会议