基于改进强化学习的移动机器人动态避障方法

Jianhua Xu; Kangkang Shao; Jiahui Wang; Xuecong Liu

doi:10.13695/j.cnki.12-1222/o3.2023.01.014

基于改进强化学习的移动机器人动态避障方法

Jianhua Xu, Kangkang Shao, Jiahui Wang, Xuecong Liu

自动化学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

5 引用（Scopus）

摘要

Aiming to solve the problems of long planning trajectory, slow travel speed and poor robustness of mobile robot dynamic obstacle avoidance in unknown environment, a mobile robot dynamic obstacle avoidance method based on improved reinforcement learning is proposed. According to its own speed, target position and laser radar information, the mobile robot can directly obtain the action signal to achieve end-to-end control. Based on distance gradient guidance and angle gradient guidance, the mobile robot is optimized towards the end point and the convergence speed of the algorithm is accelerated. Combined with convolution neural network, high-quality features are extracted from multi-dimensional observation data to improve the effect of strategy training. The simulation results show that the training speed of the proposed method is increased by 40%, the track length is reduced by more than 2.69%, and the average line speed is increased by more than 11.87% in the multi-dynamic obstacle environment. Compared with the existing mainstream obstacle avoidance methods, the proposed method has the advantages of short planning trajectory, fast travel speed, stable performance and so on. It can realize the smooth obstacle avoidance of mobile robots in the multi-obstacles environment.

投稿的翻译标题	Mobile robot dynamic obstacle avoidance method based on improved reinforcement learning
源语言	繁体中文
页（从-至）	92-99
页数	8
期刊	Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology
卷	31
期	1
DOI	https://doi.org/10.13695/j.cnki.12-1222/o3.2023.01.014
出版状态	已出版 - 1 1月 2023

关键词

convolutional neural network
dynamic obstacle avoidance
mobile robot
reinforcement learning
soft actor-critic

访问文件

10.13695/j.cnki.12-1222/o3.2023.01.014

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{fa7837686dc64da9b3d409314280ddc1,

title = "基于改进强化学习的移动机器人动态避障方法",

abstract = "Aiming to solve the problems of long planning trajectory, slow travel speed and poor robustness of mobile robot dynamic obstacle avoidance in unknown environment, a mobile robot dynamic obstacle avoidance method based on improved reinforcement learning is proposed. According to its own speed, target position and laser radar information, the mobile robot can directly obtain the action signal to achieve end-to-end control. Based on distance gradient guidance and angle gradient guidance, the mobile robot is optimized towards the end point and the convergence speed of the algorithm is accelerated. Combined with convolution neural network, high-quality features are extracted from multi-dimensional observation data to improve the effect of strategy training. The simulation results show that the training speed of the proposed method is increased by 40%, the track length is reduced by more than 2.69%, and the average line speed is increased by more than 11.87% in the multi-dynamic obstacle environment. Compared with the existing mainstream obstacle avoidance methods, the proposed method has the advantages of short planning trajectory, fast travel speed, stable performance and so on. It can realize the smooth obstacle avoidance of mobile robots in the multi-obstacles environment.",

keywords = "convolutional neural network, dynamic obstacle avoidance, mobile robot, reinforcement learning, soft actor-critic",

author = "Jianhua Xu and Kangkang Shao and Jiahui Wang and Xuecong Liu",

year = "2023",

month = jan,

day = "1",

doi = "10.13695/j.cnki.12-1222/o3.2023.01.014",

language = "繁体中文",

volume = "31",

pages = "92--99",

journal = "Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology",

issn = "1005-6734",

publisher = "Editorial Department of Journal of Chinese Inertial Technology",

number = "1",

}

TY - JOUR

T1 - 基于改进强化学习的移动机器人动态避障方法

AU - Xu, Jianhua

AU - Shao, Kangkang

AU - Wang, Jiahui

AU - Liu, Xuecong

PY - 2023/1/1

Y1 - 2023/1/1

N2 - Aiming to solve the problems of long planning trajectory, slow travel speed and poor robustness of mobile robot dynamic obstacle avoidance in unknown environment, a mobile robot dynamic obstacle avoidance method based on improved reinforcement learning is proposed. According to its own speed, target position and laser radar information, the mobile robot can directly obtain the action signal to achieve end-to-end control. Based on distance gradient guidance and angle gradient guidance, the mobile robot is optimized towards the end point and the convergence speed of the algorithm is accelerated. Combined with convolution neural network, high-quality features are extracted from multi-dimensional observation data to improve the effect of strategy training. The simulation results show that the training speed of the proposed method is increased by 40%, the track length is reduced by more than 2.69%, and the average line speed is increased by more than 11.87% in the multi-dynamic obstacle environment. Compared with the existing mainstream obstacle avoidance methods, the proposed method has the advantages of short planning trajectory, fast travel speed, stable performance and so on. It can realize the smooth obstacle avoidance of mobile robots in the multi-obstacles environment.

AB - Aiming to solve the problems of long planning trajectory, slow travel speed and poor robustness of mobile robot dynamic obstacle avoidance in unknown environment, a mobile robot dynamic obstacle avoidance method based on improved reinforcement learning is proposed. According to its own speed, target position and laser radar information, the mobile robot can directly obtain the action signal to achieve end-to-end control. Based on distance gradient guidance and angle gradient guidance, the mobile robot is optimized towards the end point and the convergence speed of the algorithm is accelerated. Combined with convolution neural network, high-quality features are extracted from multi-dimensional observation data to improve the effect of strategy training. The simulation results show that the training speed of the proposed method is increased by 40%, the track length is reduced by more than 2.69%, and the average line speed is increased by more than 11.87% in the multi-dynamic obstacle environment. Compared with the existing mainstream obstacle avoidance methods, the proposed method has the advantages of short planning trajectory, fast travel speed, stable performance and so on. It can realize the smooth obstacle avoidance of mobile robots in the multi-obstacles environment.

KW - convolutional neural network

KW - dynamic obstacle avoidance

KW - mobile robot

KW - reinforcement learning

KW - soft actor-critic

UR - http://www.scopus.com/inward/record.url?scp=85150471556&partnerID=8YFLogxK

U2 - 10.13695/j.cnki.12-1222/o3.2023.01.014

DO - 10.13695/j.cnki.12-1222/o3.2023.01.014

M3 - 文章

AN - SCOPUS:85150471556

SN - 1005-6734

VL - 31

SP - 92

EP - 99

JO - Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology

JF - Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology

IS - 1

ER -

基于改进强化学习的移动机器人动态避障方法

摘要

关键词

访问文件

其它文件与链接

指纹

引用此