TY - GEN
T1 - Target-Driven Mapless Navigation for Self-Driving Car
AU - Wen, Mingxing
AU - He, Feiyu
AU - Yue, Yufeng
AU - Zhang, Jun
AU - Zhu, Hongrun
AU - Wang, Danwei
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Self-driving cars have gained a lot of research interest in both academia and industry. However, the current solutions mainly rely on either human pre-defined rules or a precise high-resolution map, which are not feasible for the unknown environments, especially when there are some extreme situations not described in the driving rules. In this paper, a new reinforcement learning based method is proposed to address these issues. First, a pre-trained VAE (Variational AutoEncoder) is used to extract representative features from road images, then PPO (Proximal Policy Optimization) algorithm is implemented to learn target-driven navigation for the self-driving car to eliminate the dependence on the map and predefined rules. Second, to improve the learning efficiency, human driving experiences are introduced and how to effectively incorporate human driving experiences into reinforcement learning is also investigated. To evaluate the performance, this algorithm is implemented and deployed in CARLA simulation environments and extensive experiments have been conducted to select the effective strategy of reusing driving experiences. The results prove that our algorithm can successfully navigate in the urban environment without a map or any predefined rules. And by integrating human driving experiences, the learning efficiency has been dramatically improved, especially when using Ratio strategy.
AB - Self-driving cars have gained a lot of research interest in both academia and industry. However, the current solutions mainly rely on either human pre-defined rules or a precise high-resolution map, which are not feasible for the unknown environments, especially when there are some extreme situations not described in the driving rules. In this paper, a new reinforcement learning based method is proposed to address these issues. First, a pre-trained VAE (Variational AutoEncoder) is used to extract representative features from road images, then PPO (Proximal Policy Optimization) algorithm is implemented to learn target-driven navigation for the self-driving car to eliminate the dependence on the map and predefined rules. Second, to improve the learning efficiency, human driving experiences are introduced and how to effectively incorporate human driving experiences into reinforcement learning is also investigated. To evaluate the performance, this algorithm is implemented and deployed in CARLA simulation environments and extensive experiments have been conducted to select the effective strategy of reusing driving experiences. The results prove that our algorithm can successfully navigate in the urban environment without a map or any predefined rules. And by integrating human driving experiences, the learning efficiency has been dramatically improved, especially when using Ratio strategy.
UR - http://www.scopus.com/inward/record.url?scp=85124136768&partnerID=8YFLogxK
U2 - 10.1109/ICUS52573.2021.9641134
DO - 10.1109/ICUS52573.2021.9641134
M3 - Conference contribution
AN - SCOPUS:85124136768
T3 - Proceedings of 2021 IEEE International Conference on Unmanned Systems, ICUS 2021
SP - 505
EP - 511
BT - Proceedings of 2021 IEEE International Conference on Unmanned Systems, ICUS 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Conference on Unmanned Systems, ICUS 2021
Y2 - 15 October 2021 through 17 October 2021
ER -