Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment

Jianxin Zhong, Teng Long, JingLiang Sun, Junzhi Li, Yan Cao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In order to improve the convergence performance of soft actor-critic (SAC) algorithm in path planning problems, a delayed prioritized experience replay soft actor critic (DPERSAC) is proposed by designing a novel experience replay mechanism in a non-uniform manner for decreasing the convergence time. The path planning mathematical model is built for unmanned aerial vehicles (UAVs) subject to the flight performance constraints and obstacle avoidance constraints. Then the three typical elements of SAC are customized to satisfy the requirements of UAV's path planning. Differ from the traditional update manner that the soft Q-function network and policy network are updated recursively, the soft Q-function network is updated conditionally firstly and the policy network is subsequently iterated based on the trained soft Q-function in this paper. Finally, the Monte Carlo simulation results demonstrate that the computational time of the proposed DPERSAC method is only 4% of the rolling-based sparse A ∗ algorithm in the dense obstacle environment.

Original languageEnglish
Title of host publication2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages172-177
Number of pages6
ISBN (Electronic)9798350339055
DOIs
Publication statusPublished - 2023
Event9th International Conference on Control Science and Systems Engineering, ICCSSE 2023 - Shenzhen, China
Duration: 16 Jun 202318 Jun 2023

Publication series

Name2023 9th International Conference on Control Science and Systems Engineering, ICCSSE 2023

Conference

Conference9th International Conference on Control Science and Systems Engineering, ICCSSE 2023
Country/TerritoryChina
CityShenzhen
Period16/06/2318/06/23

Keywords

  • Flight Path Planning
  • Prioritized Experience Replay
  • Reinforcement Learning
  • Soft Actor-Critic

Fingerprint

Dive into the research topics of 'Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment'. Together they form a unique fingerprint.

Cite this