TY - JOUR
T1 - 基于深度强化学习的未知环境下无人艇路径规划实时算法
AU - Zhou, Zhi Guo
AU - Zheng, Yi Peng
AU - Liu, Kai Yuan
AU - He, Xu
AU - Qu, Chong
N1 - Publisher Copyright:
© 2019, Editorial Department of Transaction of Beijing Institute of Technology. All right reserved.
PY - 2019/10
Y1 - 2019/10
N2 - For the Unmanned Surface Vehicle (USV) in unknown environment, the requirements of the adaptability and real-time are strongly demanding. To this end, this paper proposes a a path planning algorithm based on Deep Reinforcement Learning (DRL). For the request of plan-avoid-acclimate, on the basis of A3C, the proposed method optimizes net architecture, enriches navigation data and re-regulate the action space of the agent. Three kinds of maps are used for targeted training to improve the flexibility. By combining with the GPU platform, the pre-training data are collected with deep neural networks. In this way, the training efficiency is improved and the real-time requirement is guaranteed. Experimental results show that, in comparison with current methods, the training time reduces by 59.3% and the efficiency rises by more than 79.5%. Moreover, the performance of the trained model in unknown environment is effectively enhanced.
AB - For the Unmanned Surface Vehicle (USV) in unknown environment, the requirements of the adaptability and real-time are strongly demanding. To this end, this paper proposes a a path planning algorithm based on Deep Reinforcement Learning (DRL). For the request of plan-avoid-acclimate, on the basis of A3C, the proposed method optimizes net architecture, enriches navigation data and re-regulate the action space of the agent. Three kinds of maps are used for targeted training to improve the flexibility. By combining with the GPU platform, the pre-training data are collected with deep neural networks. In this way, the training efficiency is improved and the real-time requirement is guaranteed. Experimental results show that, in comparison with current methods, the training time reduces by 59.3% and the efficiency rises by more than 79.5%. Moreover, the performance of the trained model in unknown environment is effectively enhanced.
KW - Deep reinforcement learning
KW - Flexibility
KW - Path planning
KW - Real-time performance
KW - Unmanned surface vehicle
UR - http://www.scopus.com/inward/record.url?scp=85106354666&partnerID=8YFLogxK
M3 - 文章
AN - SCOPUS:85106354666
SN - 1001-0645
VL - 39
SP - 86
EP - 92
JO - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
JF - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
ER -