面向无人艇的 T-DQN 智能避障算法研究

Translated title of the contribution: Research on T-DQN Intelligent Obstacle Avoidance Algorithm of Unmanned Surface Vehicle

Zhi Guo Zhou*, Si Yu Yu, Jia Bao Yu, Jun Wei Duan, Long Chen, Jun Long Chen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.

Translated title of the contributionResearch on T-DQN Intelligent Obstacle Avoidance Algorithm of Unmanned Surface Vehicle
Original languageChinese (Traditional)
Pages (from-to)1645-1655
Number of pages11
JournalZidonghua Xuebao/Acta Automatica Sinica
Volume49
Issue number8
DOIs
Publication statusPublished - Aug 2023

Fingerprint

Dive into the research topics of 'Research on T-DQN Intelligent Obstacle Avoidance Algorithm of Unmanned Surface Vehicle'. Together they form a unique fingerprint.

Cite this