Online imitation learning for self-driving simulation

Zhe Zhang, Sanyuan Zhao

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The end-to-end autonomous driving policy has made great progress with the development of deep learning. The current methods are mainly divided into imitation learning and reinforcement learning. The method of imitation learning can quickly realize the one-to-one correspondence between states and actions, but is limited by the dataset and is prone to overfitting. Therefore, the current methods mainly focus on extracting more robust input state features and proposing a more generalized dataset. Reinforcement learning methods can obtain richer input states due to online training, but at the same time requires longer training time, so current methods mainly focus on reducing training time and designing appropriate rewards. In this paper, we propose an end-to-end temporal convolution model based on segmentation medium, which uses online imitation learning to obtain richer input states, train more robust policy networks. At the same time, to reduce the training time, we use our own designed segmentation medium to replace the raw sensor information as the input of the policy network. Experiments on the CARLA driving benchmarks show that our approach achieves satisfactory results and has excellent generalization ability.

源语言英语
主期刊名ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education
出版商Institute of Electrical and Electronics Engineers Inc.
810-815
页数6
ISBN(电子版)9781665414685
DOI
出版状态已出版 - 17 8月 2021
活动16th IEEE International Conference on Computer Science and Education, ICCSE 2021 - Lancaster, 英国
期限: 17 8月 202121 8月 2021

出版系列

姓名ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education

会议

会议16th IEEE International Conference on Computer Science and Education, ICCSE 2021
国家/地区英国
Lancaster
时期17/08/2121/08/21

指纹

探究 'Online imitation learning for self-driving simulation' 的科研主题。它们共同构成独一无二的指纹。

引用此