Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment

Jianxin Zhong; Teng Long; JingLiang Sun; Junzhi Li; Yan Cao

doi:10.1109/ICCSSE59359.2023.10245929

Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment

Jianxin Zhong, Teng Long, JingLiang Sun, Junzhi Li, Yan Cao

School of Aerospace Engineering

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

In order to improve the convergence performance of soft actor-critic (SAC) algorithm in path planning problems, a delayed prioritized experience replay soft actor critic (DPERSAC) is proposed by designing a novel experience replay mechanism in a non-uniform manner for decreasing the convergence time. The path planning mathematical model is built for unmanned aerial vehicles (UAVs) subject to the flight performance constraints and obstacle avoidance constraints. Then the three typical elements of SAC are customized to satisfy the requirements of UAV's path planning. Differ from the traditional update manner that the soft Q-function network and policy network are updated recursively, the soft Q-function network is updated conditionally firstly and the policy network is subsequently iterated based on the trained soft Q-function in this paper. Finally, the Monte Carlo simulation results demonstrate that the computational time of the proposed DPERSAC method is only 4% of the rolling-based sparse A ∗ algorithm in the dense obstacle environment.

Original language	English
Title of host publication	2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	172-177
Number of pages	6
ISBN (Electronic)	9798350339055
DOIs	https://doi.org/10.1109/ICCSSE59359.2023.10245929
Publication status	Published - 2023
Event	9th International Conference on Control Science and Systems Engineering, ICCSSE 2023 - Shenzhen, China Duration: 16 Jun 2023 → 18 Jun 2023

Publication series

Name	2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023

Conference

Conference	9th International Conference on Control Science and Systems Engineering, ICCSSE 2023
Country/Territory	China
City	Shenzhen
Period	16/06/23 → 18/06/23

Keywords

Flight Path Planning
Prioritized Experience Replay
Reinforcement Learning
Soft Actor-Critic

Access to Document

10.1109/ICCSSE59359.2023.10245929

Cite this

Zhong, J., Long, T., Sun, J., Li, J., & Cao, Y. (2023). Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment. In 2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023 (pp. 172-177). (2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCSSE59359.2023.10245929

Zhong, Jianxin ; Long, Teng ; Sun, JingLiang et al. / Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment. 2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 172-177 (2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023).

@inproceedings{b8d1ba579a124efe8460c2f184a44c48,

title = "Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment",

abstract = "In order to improve the convergence performance of soft actor-critic (SAC) algorithm in path planning problems, a delayed prioritized experience replay soft actor critic (DPERSAC) is proposed by designing a novel experience replay mechanism in a non-uniform manner for decreasing the convergence time. The path planning mathematical model is built for unmanned aerial vehicles (UAVs) subject to the flight performance constraints and obstacle avoidance constraints. Then the three typical elements of SAC are customized to satisfy the requirements of UAV's path planning. Differ from the traditional update manner that the soft Q-function network and policy network are updated recursively, the soft Q-function network is updated conditionally firstly and the policy network is subsequently iterated based on the trained soft Q-function in this paper. Finally, the Monte Carlo simulation results demonstrate that the computational time of the proposed DPERSAC method is only 4% of the rolling-based sparse A ∗ algorithm in the dense obstacle environment.",

keywords = "Flight Path Planning, Prioritized Experience Replay, Reinforcement Learning, Soft Actor-Critic",

author = "Jianxin Zhong and Teng Long and JingLiang Sun and Junzhi Li and Yan Cao",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023 ; Conference date: 16-06-2023 Through 18-06-2023",

year = "2023",

doi = "10.1109/ICCSSE59359.2023.10245929",

language = "English",

series = "2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "172--177",

booktitle = "2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023",

address = "United States",

}

Zhong, J, Long, T , Sun, J, Li, J & Cao, Y 2023, Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment. in 2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023. 2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023, Institute of Electrical and Electronics Engineers Inc., pp. 172-177, 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023, Shenzhen, China, 16/06/23. https://doi.org/10.1109/ICCSSE59359.2023.10245929

Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment. / Zhong, Jianxin; Long, Teng ; Sun, JingLiang et al.
2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 172-177 (2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment

AU - Zhong, Jianxin

AU - Long, Teng

AU - Sun, JingLiang

AU - Li, Junzhi

AU - Cao, Yan

PY - 2023

Y1 - 2023

N2 - In order to improve the convergence performance of soft actor-critic (SAC) algorithm in path planning problems, a delayed prioritized experience replay soft actor critic (DPERSAC) is proposed by designing a novel experience replay mechanism in a non-uniform manner for decreasing the convergence time. The path planning mathematical model is built for unmanned aerial vehicles (UAVs) subject to the flight performance constraints and obstacle avoidance constraints. Then the three typical elements of SAC are customized to satisfy the requirements of UAV's path planning. Differ from the traditional update manner that the soft Q-function network and policy network are updated recursively, the soft Q-function network is updated conditionally firstly and the policy network is subsequently iterated based on the trained soft Q-function in this paper. Finally, the Monte Carlo simulation results demonstrate that the computational time of the proposed DPERSAC method is only 4% of the rolling-based sparse A ∗ algorithm in the dense obstacle environment.

AB - In order to improve the convergence performance of soft actor-critic (SAC) algorithm in path planning problems, a delayed prioritized experience replay soft actor critic (DPERSAC) is proposed by designing a novel experience replay mechanism in a non-uniform manner for decreasing the convergence time. The path planning mathematical model is built for unmanned aerial vehicles (UAVs) subject to the flight performance constraints and obstacle avoidance constraints. Then the three typical elements of SAC are customized to satisfy the requirements of UAV's path planning. Differ from the traditional update manner that the soft Q-function network and policy network are updated recursively, the soft Q-function network is updated conditionally firstly and the policy network is subsequently iterated based on the trained soft Q-function in this paper. Finally, the Monte Carlo simulation results demonstrate that the computational time of the proposed DPERSAC method is only 4% of the rolling-based sparse A ∗ algorithm in the dense obstacle environment.

KW - Flight Path Planning

KW - Prioritized Experience Replay

KW - Reinforcement Learning

KW - Soft Actor-Critic

UR - http://www.scopus.com/inward/record.url?scp=85173820876&partnerID=8YFLogxK

U2 - 10.1109/ICCSSE59359.2023.10245929

DO - 10.1109/ICCSSE59359.2023.10245929

M3 - Conference contribution

AN - SCOPUS:85173820876

T3 - 2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023

SP - 172

EP - 177

BT - 2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023

Y2 - 16 June 2023 through 18 June 2023

ER -

Zhong J, Long T , Sun J, Li J, Cao Y. Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment. In 2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 172-177. (2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023). doi: 10.1109/ICCSSE59359.2023.10245929

Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this