TY - JOUR
T1 - 基于改进强化学习的移动机器人动态避障方法
AU - Xu, Jianhua
AU - Shao, Kangkang
AU - Wang, Jiahui
AU - Liu, Xuecong
N1 - Publisher Copyright:
© 2023 Editorial Department of Journal of Chinese Inertial Technology. All rights reserved.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - Aiming to solve the problems of long planning trajectory, slow travel speed and poor robustness of mobile robot dynamic obstacle avoidance in unknown environment, a mobile robot dynamic obstacle avoidance method based on improved reinforcement learning is proposed. According to its own speed, target position and laser radar information, the mobile robot can directly obtain the action signal to achieve end-to-end control. Based on distance gradient guidance and angle gradient guidance, the mobile robot is optimized towards the end point and the convergence speed of the algorithm is accelerated. Combined with convolution neural network, high-quality features are extracted from multi-dimensional observation data to improve the effect of strategy training. The simulation results show that the training speed of the proposed method is increased by 40%, the track length is reduced by more than 2.69%, and the average line speed is increased by more than 11.87% in the multi-dynamic obstacle environment. Compared with the existing mainstream obstacle avoidance methods, the proposed method has the advantages of short planning trajectory, fast travel speed, stable performance and so on. It can realize the smooth obstacle avoidance of mobile robots in the multi-obstacles environment.
AB - Aiming to solve the problems of long planning trajectory, slow travel speed and poor robustness of mobile robot dynamic obstacle avoidance in unknown environment, a mobile robot dynamic obstacle avoidance method based on improved reinforcement learning is proposed. According to its own speed, target position and laser radar information, the mobile robot can directly obtain the action signal to achieve end-to-end control. Based on distance gradient guidance and angle gradient guidance, the mobile robot is optimized towards the end point and the convergence speed of the algorithm is accelerated. Combined with convolution neural network, high-quality features are extracted from multi-dimensional observation data to improve the effect of strategy training. The simulation results show that the training speed of the proposed method is increased by 40%, the track length is reduced by more than 2.69%, and the average line speed is increased by more than 11.87% in the multi-dynamic obstacle environment. Compared with the existing mainstream obstacle avoidance methods, the proposed method has the advantages of short planning trajectory, fast travel speed, stable performance and so on. It can realize the smooth obstacle avoidance of mobile robots in the multi-obstacles environment.
KW - convolutional neural network
KW - dynamic obstacle avoidance
KW - mobile robot
KW - reinforcement learning
KW - soft actor-critic
UR - http://www.scopus.com/inward/record.url?scp=85150471556&partnerID=8YFLogxK
U2 - 10.13695/j.cnki.12-1222/o3.2023.01.014
DO - 10.13695/j.cnki.12-1222/o3.2023.01.014
M3 - 文章
AN - SCOPUS:85150471556
SN - 1005-6734
VL - 31
SP - 92
EP - 99
JO - Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology
JF - Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology
IS - 1
ER -