Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven

Shiqi Liu, Jiawei Chen, Bowen Zu, Xuehua Zhou, Zhiguo Zhou*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The application of deep reinforcement learning (DRL) for autonomous navigation of unmanned ground vehicle (UGV) has the problem of sparse rewards, which makes the trained algorithm model difficult to converge and cannot be transferred to real vehicles. In this regard, this paper proposes an effective exploratory learning autonomous navigation algorithm Double I-PPO, which designs pre-training behaviors based on imitation learning (IL) to guide UGV to try positive states, and introduces the intrinsic curiosity module (ICM) to generate intrinsic reward signals to encourage exploratory learning strategies. Build the training scene in Unity to evaluate the performance of the algorithm, and integrate the algorithm strategy into the motion planning stack of the ROS vehicle, so as to extend to the actual scene for testing. Experiments show that in the environment of random obstacles, the method does not need to rely on prior map information. Compared with similar DRL algorithms, the convergence speed is faster and the navigation success rate can reach more than 85%.

Original languageEnglish
Title of host publicationMethods and Applications for Modeling and Simulation of Complex Systems - 21st Asia Simulation Conference, AsiaSim 2022, Proceedings
EditorsWenhui Fan, Lin Zhang, Ni Li, Xiao Song
PublisherSpringer Science and Business Media Deutschland GmbH
Pages609-621
Number of pages13
ISBN (Print)9789811991974
DOIs
Publication statusPublished - 2022
Event21st Asia Simulation Conference, AsiaSim 2022 - Changsha, China
Duration: 9 Dec 202211 Dec 2022

Publication series

NameCommunications in Computer and Information Science
Volume1712 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference21st Asia Simulation Conference, AsiaSim 2022
Country/TerritoryChina
CityChangsha
Period9/12/2211/12/22

Keywords

  • Deep reinforcement learning
  • Navigation
  • ROS
  • Spare reward
  • Unity
  • Unmanned ground vehicle

Fingerprint

Dive into the research topics of 'Research on Navigation Algorithm of Unmanned Ground Vehicle Based on Imitation Learning and Curiosity Driven'. Together they form a unique fingerprint.

Cite this