面向无人艇的 T-DQN 智能避障算法研究

Zhi Guo Zhou; Si Yu Yu; Jia Bao Yu; Jun Wei Duan; Long Chen; Jun Long Chen

doi:10.16383/j.aas.c210080

面向无人艇的 T-DQN 智能避障算法研究

Translated title of the contribution: Research on T-DQN Intelligent Obstacle Avoidance Algorithm of Unmanned Surface Vehicle

Zhi Guo Zhou^*, Si Yu Yu, Jia Bao Yu, Jun Wei Duan, Long Chen, Jun Long Chen

^*Corresponding author for this work

School of Integrated Circuits and Electronics

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.

Translated title of the contribution	Research on T-DQN Intelligent Obstacle Avoidance Algorithm of Unmanned Surface Vehicle
Original language	Chinese (Traditional)
Pages (from-to)	1645-1655
Number of pages	11
Journal	Zidonghua Xuebao/Acta Automatica Sinica
Volume	49
Issue number	8
DOIs	https://doi.org/10.16383/j.aas.c210080
Publication status	Published - Aug 2023

Access to Document

10.16383/j.aas.c210080

Cite this

@article{42d454562549468884c77cadbf16a47a,

title = "面向无人艇的 T-DQN 智能避障算法研究",

abstract = "Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.",

keywords = "Unmanned surface vehicle (USV), deep Q network (DQN), intelligent obstacle avoidance, reinforcement learning",

author = "Zhou, {Zhi Guo} and Yu, {Si Yu} and Yu, {Jia Bao} and Duan, {Jun Wei} and Long Chen and Chen, {Jun Long}",

year = "2023",

month = aug,

doi = "10.16383/j.aas.c210080",

language = "繁体中文",

volume = "49",

pages = "1645--1655",

journal = "Zidonghua Xuebao/Acta Automatica Sinica",

issn = "0254-4156",

publisher = "Science China Press",

number = "8",

}

TY - JOUR

T1 - 面向无人艇的 T-DQN 智能避障算法研究

AU - Zhou, Zhi Guo

AU - Yu, Si Yu

AU - Yu, Jia Bao

AU - Duan, Jun Wei

AU - Chen, Long

AU - Chen, Jun Long

PY - 2023/8

Y1 - 2023/8

N2 - Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.

AB - Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.

KW - Unmanned surface vehicle (USV)

KW - deep Q network (DQN)

KW - intelligent obstacle avoidance

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85184038782&partnerID=8YFLogxK

U2 - 10.16383/j.aas.c210080

DO - 10.16383/j.aas.c210080

M3 - 文章

AN - SCOPUS:85184038782

SN - 0254-4156

VL - 49

SP - 1645

EP - 1655

JO - Zidonghua Xuebao/Acta Automatica Sinica

JF - Zidonghua Xuebao/Acta Automatica Sinica

IS - 8

ER -

面向无人艇的 T-DQN 智能避障算法研究

Abstract

Access to Document

Other files and links

Fingerprint

Cite this