面向无人艇的 T-DQN 智能避障算法研究

Zhi Guo Zhou*, Si Yu Yu, Jia Bao Yu, Jun Wei Duan, Long Chen, Jun Long Chen

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.

投稿的翻译标题Research on T-DQN Intelligent Obstacle Avoidance Algorithm of Unmanned Surface Vehicle
源语言繁体中文
页(从-至)1645-1655
页数11
期刊Zidonghua Xuebao/Acta Automatica Sinica
49
8
DOI
出版状态已出版 - 8月 2023

关键词

  • Unmanned surface vehicle (USV)
  • deep Q network (DQN)
  • intelligent obstacle avoidance
  • reinforcement learning

指纹

探究 '面向无人艇的 T-DQN 智能避障算法研究' 的科研主题。它们共同构成独一无二的指纹。

引用此