Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

Shiqi Liu; Jiawei Chen; Bowen Zu; Xuehua Zhou; Zhiguo Zhou

doi:10.1007/978-981-19-9198-1_46

Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

Shiqi Liu, Jiawei Chen, Bowen Zu, Xuehua Zhou, Zhiguo Zhou^*

^*此作品的通讯作者

集成电路与电子学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.

源语言	英语
主期刊名	Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings
编辑	Wenhui Fan, Lin Zhang, Ni Li, Xiao Song
出版商	Springer Science and Business Media Deutschland GmbH
页	609-621
页数	13
ISBN（印刷版）	9789811991974
DOI	https://doi.org/10.1007/978-981-19-9198-1_46
出版状态	已出版 - 2022
活动	21st Asia Simulation Conference, AsiaSim 2022 - Changsha, 中国期限: 9 12月 2022 → 11 12月 2022

出版系列

姓名	Communications in Computer and Information Science
卷	1712 CCIS
ISSN（印刷版）	1865-0929
ISSN（电子版）	1865-0937

会议

会议	21st Asia Simulation Conference, AsiaSim 2022
国家/地区	中国
市	Changsha
时期	9/12/22 → 11/12/22

访问文件

10.1007/978-981-19-9198-1_46

其它文件与链接

链接到 Scopus 的出版物

引用此

Liu, S., Chen, J., Zu, B., Zhou, X., & Zhou, Z. (2022). Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. 在 W. Fan, L. Zhang, N. Li, & X. Song (编辑), Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings (页码 609-621). (Communications in Computer and Information Science; 卷 1712 CCIS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-19-9198-1_46

Liu, Shiqi ; Chen, Jiawei ; Zu, Bowen 等. / Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings. 编辑 / Wenhui Fan ; Lin Zhang ; Ni Li ; Xiao Song. Springer Science and Business Media Deutschland GmbH, 2022. 页码 609-621 (Communications in Computer and Information Science).

@inproceedings{14adaf0733664318959922add02f3d04,

title = "Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven",

abstract = "The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.",

keywords = "Deep reinforcement learning, Navigation, ROS, Spare reward, Unity, Unmanned ground vehicle",

author = "Shiqi Liu and Jiawei Chen and Bowen Zu and Xuehua Zhou and Zhiguo Zhou",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.; 21st Asia Simulation Conference, AsiaSim 2022 ; Conference date: 09-12-2022 Through 11-12-2022",

year = "2022",

doi = "10.1007/978-981-19-9198-1_46",

language = "English",

isbn = "9789811991974",

series = "Communications in Computer and Information Science",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "609--621",

editor = "Wenhui Fan and Lin Zhang and Ni Li and Xiao Song",

booktitle = "Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings",

address = "Germany",

}

Liu, S, Chen, J, Zu, B, Zhou, X & Zhou, Z 2022, Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. 在 W Fan, L Zhang, N Li & X Song (编辑), Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings. Communications in Computer and Information Science, 卷 1712 CCIS, Springer Science and Business Media Deutschland GmbH, 页码 609-621, 21st Asia Simulation Conference, AsiaSim 2022, Changsha, 中国, 9/12/22. https://doi.org/10.1007/978-981-19-9198-1_46

Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. / Liu, Shiqi; Chen, Jiawei; Zu, Bowen 等.
Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings. 编辑 / Wenhui Fan; Lin Zhang; Ni Li; Xiao Song. Springer Science and Business Media Deutschland GmbH, 2022. 页码 609-621 (Communications in Computer and Information Science; 卷 1712 CCIS).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

AU - Liu, Shiqi

AU - Chen, Jiawei

AU - Zu, Bowen

AU - Zhou, Xuehua

AU - Zhou, Zhiguo

PY - 2022

Y1 - 2022

N2 - The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.

AB - The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.

KW - Deep reinforcement learning

KW - Navigation

KW - ROS

KW - Spare reward

KW - Unity

KW - Unmanned ground vehicle

UR - http://www.scopus.com/inward/record.url?scp=85146654232&partnerID=8YFLogxK

U2 - 10.1007/978-981-19-9198-1_46

DO - 10.1007/978-981-19-9198-1_46

M3 - Conference contribution

AN - SCOPUS:85146654232

SN - 9789811991974

T3 - Communications in Computer and Information Science

SP - 609

EP - 621

BT - Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings

A2 - Fan, Wenhui

A2 - Zhang, Lin

A2 - Li, Ni

A2 - Song, Xiao

PB - Springer Science and Business Media Deutschland GmbH

T2 - 21st Asia Simulation Conference, AsiaSim 2022

Y2 - 9 December 2022 through 11 December 2022

ER -

Liu S, Chen J, Zu B, Zhou X, Zhou Z. Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. 在 Fan W, Zhang L, Li N, Song X, 编辑, Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings. Springer Science and Business Media Deutschland GmbH. 2022. 页码 609-621. (Communications in Computer and Information Science). doi: 10.1007/978-981-19-9198-1_46

Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此