A State-Decomposition DDPG Algorithm for UAV Autonomous Navigation in 3-D Complex Environments

Lijuan Zhang; Jiabin Peng; Weiguo Yi; Hang Lin; Lei Lei; Xiaoqin Song

doi:10.1109/JIOT.2023.3327753

A State-Decomposition DDPG Algorithm for UAV Autonomous Navigation in 3-D Complex Environments

Lijuan Zhang, Jiabin Peng, Weiguo Yi, Hang Lin, Lei Lei^*, Xiaoqin Song

^*此作品的通讯作者

Nanjing University of Aeronautics and Astronautics

科研成果: 期刊稿件 › 文章 › 同行评审

6 引用（Scopus）

摘要

Over the past decade, unmanned aerial vehicles (UAVs) have been widely applied in many areas, such as goods delivery, disaster monitoring, search and rescue etc. In most of these applications, autonomous navigation is one of the key techniques that enable UAV to perform various tasks. However, UAV autonomous navigation in complex environments presents significant challenges due to the difficulty in simultaneously observing, orientation, decision and action. In this work, an efficient state-decomposition deep deterministic policy gradient algorithm is proposed for UAV autonomous navigation (SDDPG-NAV) in 3-D complex environments. In SDDPG-NAV, a novel state-decomposition method that uses two subnetworks for the perception-related and target-related states separately is developed to establish more appropriate actor networks. We also designed some objective-oriented reward functions to solve the sparse reward problem, including approaching the target, and avoiding obstacles and step award functions. Moreover, some training strategies are introduced to maintain the balance between exploration and exploitation, and the network is well trained with numerous experiments. The proposed SDDPG-NAV algorithm is capable of adapting to surrounding environments with generalized training experiences and effectively improves UAV's navigation performance in 3-D complex environments. Comparing with the benchmark DDPG and TD3 algorithms, SDDPG-NAV exhibits better performance in terms of convergence rate, navigation performance, and generalization capability.

源语言	英语
页（从-至）	10778-10790
页数	13
期刊	IEEE Internet of Things Journal
卷	11
期	6
DOI	https://doi.org/10.1109/JIOT.2023.3327753
出版状态	已出版 - 15 3月 2024
已对外发布	是

访问文件

10.1109/JIOT.2023.3327753

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{c35511ae17854fa8b790bbbfa63a407c,

title = "A State-Decomposition DDPG Algorithm for UAV Autonomous Navigation in 3-D Complex Environments",

abstract = "Over the past decade, unmanned aerial vehicles (UAVs) have been widely applied in many areas, such as goods delivery, disaster monitoring, search and rescue etc. In most of these applications, autonomous navigation is one of the key techniques that enable UAV to perform various tasks. However, UAV autonomous navigation in complex environments presents significant challenges due to the difficulty in simultaneously observing, orientation, decision and action. In this work, an efficient state-decomposition deep deterministic policy gradient algorithm is proposed for UAV autonomous navigation (SDDPG-NAV) in 3-D complex environments. In SDDPG-NAV, a novel state-decomposition method that uses two subnetworks for the perception-related and target-related states separately is developed to establish more appropriate actor networks. We also designed some objective-oriented reward functions to solve the sparse reward problem, including approaching the target, and avoiding obstacles and step award functions. Moreover, some training strategies are introduced to maintain the balance between exploration and exploitation, and the network is well trained with numerous experiments. The proposed SDDPG-NAV algorithm is capable of adapting to surrounding environments with generalized training experiences and effectively improves UAV's navigation performance in 3-D complex environments. Comparing with the benchmark DDPG and TD3 algorithms, SDDPG-NAV exhibits better performance in terms of convergence rate, navigation performance, and generalization capability.",

keywords = "Autonomous navigation, decision making, deep reinforcement learning (DRL), path planning, unmanned aerial vehicle (UAV) autonomy",

author = "Lijuan Zhang and Jiabin Peng and Weiguo Yi and Hang Lin and Lei Lei and Xiaoqin Song",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2024",

month = mar,

day = "15",

doi = "10.1109/JIOT.2023.3327753",

language = "English",

volume = "11",

pages = "10778--10790",

journal = "IEEE Internet of Things Journal",

issn = "2327-4662",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "6",

}

TY - JOUR

T1 - A State-Decomposition DDPG Algorithm for UAV Autonomous Navigation in 3-D Complex Environments

AU - Zhang, Lijuan

AU - Peng, Jiabin

AU - Yi, Weiguo

AU - Lin, Hang

AU - Lei, Lei

AU - Song, Xiaoqin

PY - 2024/3/15

Y1 - 2024/3/15

N2 - Over the past decade, unmanned aerial vehicles (UAVs) have been widely applied in many areas, such as goods delivery, disaster monitoring, search and rescue etc. In most of these applications, autonomous navigation is one of the key techniques that enable UAV to perform various tasks. However, UAV autonomous navigation in complex environments presents significant challenges due to the difficulty in simultaneously observing, orientation, decision and action. In this work, an efficient state-decomposition deep deterministic policy gradient algorithm is proposed for UAV autonomous navigation (SDDPG-NAV) in 3-D complex environments. In SDDPG-NAV, a novel state-decomposition method that uses two subnetworks for the perception-related and target-related states separately is developed to establish more appropriate actor networks. We also designed some objective-oriented reward functions to solve the sparse reward problem, including approaching the target, and avoiding obstacles and step award functions. Moreover, some training strategies are introduced to maintain the balance between exploration and exploitation, and the network is well trained with numerous experiments. The proposed SDDPG-NAV algorithm is capable of adapting to surrounding environments with generalized training experiences and effectively improves UAV's navigation performance in 3-D complex environments. Comparing with the benchmark DDPG and TD3 algorithms, SDDPG-NAV exhibits better performance in terms of convergence rate, navigation performance, and generalization capability.

AB - Over the past decade, unmanned aerial vehicles (UAVs) have been widely applied in many areas, such as goods delivery, disaster monitoring, search and rescue etc. In most of these applications, autonomous navigation is one of the key techniques that enable UAV to perform various tasks. However, UAV autonomous navigation in complex environments presents significant challenges due to the difficulty in simultaneously observing, orientation, decision and action. In this work, an efficient state-decomposition deep deterministic policy gradient algorithm is proposed for UAV autonomous navigation (SDDPG-NAV) in 3-D complex environments. In SDDPG-NAV, a novel state-decomposition method that uses two subnetworks for the perception-related and target-related states separately is developed to establish more appropriate actor networks. We also designed some objective-oriented reward functions to solve the sparse reward problem, including approaching the target, and avoiding obstacles and step award functions. Moreover, some training strategies are introduced to maintain the balance between exploration and exploitation, and the network is well trained with numerous experiments. The proposed SDDPG-NAV algorithm is capable of adapting to surrounding environments with generalized training experiences and effectively improves UAV's navigation performance in 3-D complex environments. Comparing with the benchmark DDPG and TD3 algorithms, SDDPG-NAV exhibits better performance in terms of convergence rate, navigation performance, and generalization capability.

KW - Autonomous navigation

KW - decision making

KW - deep reinforcement learning (DRL)

KW - path planning

KW - unmanned aerial vehicle (UAV) autonomy

UR - http://www.scopus.com/inward/record.url?scp=85181560082&partnerID=8YFLogxK

U2 - 10.1109/JIOT.2023.3327753

DO - 10.1109/JIOT.2023.3327753

M3 - Article

AN - SCOPUS:85181560082

SN - 2327-4662

VL - 11

SP - 10778

EP - 10790

JO - IEEE Internet of Things Journal

JF - IEEE Internet of Things Journal

IS - 6

ER -

A State-Decomposition DDPG Algorithm for UAV Autonomous Navigation in 3-D Complex Environments

摘要

访问文件

其它文件与链接

指纹

引用此