A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning

Zhiguo Zhou; Yipeng Zheng; Kaiyuan Liu; Xu He; Chong Qu

doi:10.1109/ICSIDP47821.2019.9173280

A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning

Zhiguo Zhou, Yipeng Zheng, Kaiyuan Liu, Xu He, Chong Qu

Shanghai Marine Diesel Engine Research Institute

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

8 Citations (Scopus)

Abstract

Aiming at the demand of flexibility and real-time performance in unknown aquatorium, a path planning algorithm based on Deep Reinforcement Learning (DRL) is proposed. According to plan-avoid-acclimate request, the proposed algorithm involves optimization of net structure and navigation data enrichment based on A3C, re-regulation of action space of the agent, and is trained with specific tasks in three kinds of maps to improve flexibility. The algorithm is integrated with GPU, which helps achieve high training efficiency and real-time performance by creating a neural network to collect pre-training data. Experimental results show that obstacle ability is confirmed. In comparison with current algorithm, training time reduces by 59.3% and efficiency rises by more than 71.7^%. Meanwhile, performance of trained model in unknown environment is validated.

Original language	English
Title of host publication	ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781728123455
DOIs	https://doi.org/10.1109/ICSIDP47821.2019.9173280
Publication status	Published - Dec 2019
Externally published	Yes
Event	2019 IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2019 - Chongqing, China Duration: 11 Dec 2019 → 13 Dec 2019

Publication series

Name	ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019

Conference

Conference	2019 IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2019
Country/Territory	China
City	Chongqing
Period	11/12/19 → 13/12/19

Keywords

deep reinforcement learning
flexibility
path planning
real-time performance
unmanned surface vehicle

Access to Document

10.1109/ICSIDP47821.2019.9173280

Cite this

Zhou, Z., Zheng, Y., Liu, K., He, X., & Qu, C. (2019). A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning. In ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019 Article 9173280 (ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSIDP47821.2019.9173280

Zhou, Zhiguo ; Zheng, Yipeng ; Liu, Kaiyuan et al. / A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning. ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019. Institute of Electrical and Electronics Engineers Inc., 2019. (ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019).

@inproceedings{f367c7dfd159427bb654fdf44de6d7ff,

title = "A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning",

abstract = "Aiming at the demand of flexibility and real-time performance in unknown aquatorium, a path planning algorithm based on Deep Reinforcement Learning (DRL) is proposed. According to plan-avoid-acclimate request, the proposed algorithm involves optimization of net structure and navigation data enrichment based on A3C, re-regulation of action space of the agent, and is trained with specific tasks in three kinds of maps to improve flexibility. The algorithm is integrated with GPU, which helps achieve high training efficiency and real-time performance by creating a neural network to collect pre-training data. Experimental results show that obstacle ability is confirmed. In comparison with current algorithm, training time reduces by 59.3% and efficiency rises by more than 71.7%. Meanwhile, performance of trained model in unknown environment is validated.",

keywords = "deep reinforcement learning, flexibility, path planning, real-time performance, unmanned surface vehicle",

author = "Zhiguo Zhou and Yipeng Zheng and Kaiyuan Liu and Xu He and Chong Qu",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 2019 IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2019 ; Conference date: 11-12-2019 Through 13-12-2019",

year = "2019",

month = dec,

doi = "10.1109/ICSIDP47821.2019.9173280",

language = "English",

series = "ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019",

address = "United States",

}

Zhou, Z, Zheng, Y, Liu, K, He, X & Qu, C 2019, A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning. in ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019., 9173280, ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019, Institute of Electrical and Electronics Engineers Inc., 2019 IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2019, Chongqing, China, 11/12/19. https://doi.org/10.1109/ICSIDP47821.2019.9173280

A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning. / Zhou, Zhiguo; Zheng, Yipeng; Liu, Kaiyuan et al.
ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019. Institute of Electrical and Electronics Engineers Inc., 2019. 9173280 (ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning

AU - Zhou, Zhiguo

AU - Zheng, Yipeng

AU - Liu, Kaiyuan

AU - He, Xu

AU - Qu, Chong

PY - 2019/12

Y1 - 2019/12

N2 - Aiming at the demand of flexibility and real-time performance in unknown aquatorium, a path planning algorithm based on Deep Reinforcement Learning (DRL) is proposed. According to plan-avoid-acclimate request, the proposed algorithm involves optimization of net structure and navigation data enrichment based on A3C, re-regulation of action space of the agent, and is trained with specific tasks in three kinds of maps to improve flexibility. The algorithm is integrated with GPU, which helps achieve high training efficiency and real-time performance by creating a neural network to collect pre-training data. Experimental results show that obstacle ability is confirmed. In comparison with current algorithm, training time reduces by 59.3% and efficiency rises by more than 71.7%. Meanwhile, performance of trained model in unknown environment is validated.

AB - Aiming at the demand of flexibility and real-time performance in unknown aquatorium, a path planning algorithm based on Deep Reinforcement Learning (DRL) is proposed. According to plan-avoid-acclimate request, the proposed algorithm involves optimization of net structure and navigation data enrichment based on A3C, re-regulation of action space of the agent, and is trained with specific tasks in three kinds of maps to improve flexibility. The algorithm is integrated with GPU, which helps achieve high training efficiency and real-time performance by creating a neural network to collect pre-training data. Experimental results show that obstacle ability is confirmed. In comparison with current algorithm, training time reduces by 59.3% and efficiency rises by more than 71.7%. Meanwhile, performance of trained model in unknown environment is validated.

KW - deep reinforcement learning

KW - flexibility

KW - path planning

KW - real-time performance

KW - unmanned surface vehicle

UR - http://www.scopus.com/inward/record.url?scp=85091891243&partnerID=8YFLogxK

U2 - 10.1109/ICSIDP47821.2019.9173280

DO - 10.1109/ICSIDP47821.2019.9173280

M3 - Conference contribution

AN - SCOPUS:85091891243

T3 - ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019

BT - ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2019 IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2019

Y2 - 11 December 2019 through 13 December 2019

ER -

Zhou Z, Zheng Y, Liu K, He X, Qu C. A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning. In ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019. Institute of Electrical and Electronics Engineers Inc. 2019. 9173280. (ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019). doi: 10.1109/ICSIDP47821.2019.9173280

A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this