Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

Shiqi Liu; Jiawei Chen; Bowen Zu; Xuehua Zhou; Zhiguo Zhou

doi:10.1007/978-981-19-9198-1_46

Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

Shiqi Liu, Jiawei Chen, Bowen Zu, Xuehua Zhou, Zhiguo Zhou^*

^*Corresponding author for this work

School of Integrated Circuits and Electronics

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.

Original language	English
Title of host publication	Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings
Editors	Wenhui Fan, Lin Zhang, Ni Li, Xiao Song
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	609-621
Number of pages	13
ISBN (Print)	9789811991974
DOIs	https://doi.org/10.1007/978-981-19-9198-1_46
Publication status	Published - 2022
Event	21st Asia Simulation Conference, AsiaSim 2022 - Changsha, China Duration: 9 Dec 2022 → 11 Dec 2022

Publication series

Name	Communications in Computer and Information Science
Volume	1712 CCIS
ISSN (Print)	1865-0929
ISSN (Electronic)	1865-0937

Conference

Conference	21st Asia Simulation Conference, AsiaSim 2022
Country/Territory	China
City	Changsha
Period	9/12/22 → 11/12/22

Keywords

Deep reinforcement learning
Navigation
ROS
Spare reward
Unity
Unmanned ground vehicle

Access to Document

10.1007/978-981-19-9198-1_46

Cite this

Liu, S., Chen, J., Zu, B., Zhou, X., & Zhou, Z. (2022). Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. In W. Fan, L. Zhang, N. Li, & X. Song (Eds.), Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings (pp. 609-621). (Communications in Computer and Information Science; Vol. 1712 CCIS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-19-9198-1_46

Liu, Shiqi ; Chen, Jiawei ; Zu, Bowen et al. / Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings. editor / Wenhui Fan ; Lin Zhang ; Ni Li ; Xiao Song. Springer Science and Business Media Deutschland GmbH, 2022. pp. 609-621 (Communications in Computer and Information Science).

@inproceedings{14adaf0733664318959922add02f3d04,

title = "Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven",

abstract = "The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.",

keywords = "Deep reinforcement learning, Navigation, ROS, Spare reward, Unity, Unmanned ground vehicle",

author = "Shiqi Liu and Jiawei Chen and Bowen Zu and Xuehua Zhou and Zhiguo Zhou",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.; 21st Asia Simulation Conference, AsiaSim 2022 ; Conference date: 09-12-2022 Through 11-12-2022",

year = "2022",

doi = "10.1007/978-981-19-9198-1_46",

language = "English",

isbn = "9789811991974",

series = "Communications in Computer and Information Science",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "609--621",

editor = "Wenhui Fan and Lin Zhang and Ni Li and Xiao Song",

booktitle = "Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings",

address = "Germany",

}

Liu, S, Chen, J, Zu, B, Zhou, X & Zhou, Z 2022, Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. in W Fan, L Zhang, N Li & X Song (eds), Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings. Communications in Computer and Information Science, vol. 1712 CCIS, Springer Science and Business Media Deutschland GmbH, pp. 609-621, 21st Asia Simulation Conference, AsiaSim 2022, Changsha, China, 9/12/22. https://doi.org/10.1007/978-981-19-9198-1_46

Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. / Liu, Shiqi; Chen, Jiawei; Zu, Bowen et al.
Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings. ed. / Wenhui Fan; Lin Zhang; Ni Li; Xiao Song. Springer Science and Business Media Deutschland GmbH, 2022. p. 609-621 (Communications in Computer and Information Science; Vol. 1712 CCIS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

AU - Liu, Shiqi

AU - Chen, Jiawei

AU - Zu, Bowen

AU - Zhou, Xuehua

AU - Zhou, Zhiguo

PY - 2022

Y1 - 2022

N2 - The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.

AB - The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.

KW - Deep reinforcement learning

KW - Navigation

KW - ROS

KW - Spare reward

KW - Unity

KW - Unmanned ground vehicle

UR - http://www.scopus.com/inward/record.url?scp=85146654232&partnerID=8YFLogxK

U2 - 10.1007/978-981-19-9198-1_46

DO - 10.1007/978-981-19-9198-1_46

M3 - Conference contribution

AN - SCOPUS:85146654232

SN - 9789811991974

T3 - Communications in Computer and Information Science

SP - 609

EP - 621

BT - Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings

A2 - Fan, Wenhui

A2 - Zhang, Lin

A2 - Li, Ni

A2 - Song, Xiao

PB - Springer Science and Business Media Deutschland GmbH

T2 - 21st Asia Simulation Conference, AsiaSim 2022

Y2 - 9 December 2022 through 11 December 2022

ER -

Liu S, Chen J, Zu B, Zhou X, Zhou Z. Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven. In Fan W, Zhang L, Li N, Song X, editors, Methods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings. Springer Science and Business Media Deutschland GmbH. 2022. p. 609-621. (Communications in Computer and Information Science). doi: 10.1007/978-981-19-9198-1_46

Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this