面向无人艇的 T-DQN 智能避障算法研究

Zhi Guo Zhou; Si Yu Yu; Jia Bao Yu; Jun Wei Duan; Long Chen; Jun Long Chen

doi:10.16383/j.aas.c210080

面向无人艇的 T-DQN 智能避障算法研究

Zhi Guo Zhou^*, Si Yu Yu, Jia Bao Yu, Jun Wei Duan, Long Chen, Jun Long Chen

^*此作品的通讯作者

集成电路与电子学院

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.

投稿的翻译标题	Research on T-DQN Intelligent Obstacle Avoidance Algorithm of Unmanned Surface Vehicle
源语言	繁体中文
页（从-至）	1645-1655
页数	11
期刊	Zidonghua Xuebao/Acta Automatica Sinica
卷	49
期	8
DOI	https://doi.org/10.16383/j.aas.c210080
出版状态	已出版 - 8月 2023

关键词

Unmanned surface vehicle (USV)
deep Q network (DQN)
intelligent obstacle avoidance
reinforcement learning

访问文件

10.16383/j.aas.c210080

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{42d454562549468884c77cadbf16a47a,

title = "面向无人艇的 T-DQN 智能避障算法研究",

abstract = "Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.",

keywords = "Unmanned surface vehicle (USV), deep Q network (DQN), intelligent obstacle avoidance, reinforcement learning",

author = "Zhou, {Zhi Guo} and Yu, {Si Yu} and Yu, {Jia Bao} and Duan, {Jun Wei} and Long Chen and Chen, {Jun Long}",

year = "2023",

month = aug,

doi = "10.16383/j.aas.c210080",

language = "繁体中文",

volume = "49",

pages = "1645--1655",

journal = "Zidonghua Xuebao/Acta Automatica Sinica",

issn = "0254-4156",

publisher = "Science China Press",

number = "8",

}

TY - JOUR

T1 - 面向无人艇的 T-DQN 智能避障算法研究

AU - Zhou, Zhi Guo

AU - Yu, Si Yu

AU - Yu, Jia Bao

AU - Duan, Jun Wei

AU - Chen, Long

AU - Chen, Jun Long

PY - 2023/8

Y1 - 2023/8

N2 - Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.

AB - Unmanned surface vehicle (USV) is a kind of unmanned system with wide application prospect, and it is important to train the autonomous decision-making ability. Due to the wide water surface motion environment, traditional obstacle avoidance algorithms are difficult to independently plan a reasonable route under quantitative rules, while the general reinforcement learning methods are difficult to converge quickly in large and complex environment. To solve these problems, we propose a threshold deep Q network (T-DQN) algorithm, by adding long short-term memory (LSTM) network on basis of deep Q network (DQN), to save training information, and setting proper threshold value of experience replay pool to accelerate convergence. We conducted simulation experiments in different sizes grid, and the results show T-DQN method can converge to optimal path quickly, compared with the Q-learning and DQN, the number of convergence episodes is reduced by 69.1%, and 24.8%, respectively. The threshold mechanism reduces overall convergence steps by 41.1%. We also verified the algorithm in Unity 3D reinforcement learning simulation platform to investigate the completion of obstacle avoidance tasks under complex maps, the experiment results show that the algorithm can realize detailed obstacle avoidance and intelligent safe navigation.

KW - Unmanned surface vehicle (USV)

KW - deep Q network (DQN)

KW - intelligent obstacle avoidance

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85184038782&partnerID=8YFLogxK

U2 - 10.16383/j.aas.c210080

DO - 10.16383/j.aas.c210080

M3 - 文章

AN - SCOPUS:85184038782

SN - 0254-4156

VL - 49

SP - 1645

EP - 1655

JO - Zidonghua Xuebao/Acta Automatica Sinica

JF - Zidonghua Xuebao/Acta Automatica Sinica

IS - 8

ER -

面向无人艇的 T-DQN 智能避障算法研究

摘要

关键词

访问文件

其它文件与链接

指纹

引用此