Online imitation learning for self-driving simulation

Zhe Zhang; Sanyuan Zhao

doi:10.1109/ICCSE51940.2021.9569543

Online imitation learning for self-driving simulation

Zhe Zhang, Sanyuan Zhao

计算机学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

The end-to-end autonomous driving policy has made great progress with the development of deep learning. The current methods are mainly divided into imitation learning and reinforcement learning. The method of imitation learning can quickly realize the one-to-one correspondence between states and actions, but is limited by the dataset and is prone to overfitting. Therefore, the current methods mainly focus on extracting more robust input state features and proposing a more generalized dataset. Reinforcement learning methods can obtain richer input states due to online training, but at the same time requires longer training time, so current methods mainly focus on reducing training time and designing appropriate rewards. In this paper, we propose an end-to-end temporal convolution model based on segmentation medium, which uses online imitation learning to obtain richer input states, train more robust policy networks. At the same time, to reduce the training time, we use our own designed segmentation medium to replace the raw sensor information as the input of the policy network. Experiments on the CARLA driving benchmarks show that our approach achieves satisfactory results and has excellent generalization ability.

源语言	英语
主期刊名	ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education
出版商	Institute of Electrical and Electronics Engineers Inc.
页	810-815
页数	6
ISBN（电子版）	9781665414685
DOI	https://doi.org/10.1109/ICCSE51940.2021.9569543
出版状态	已出版 - 17 8月 2021
活动	16th IEEE International Conference on Computer Science and Education, ICCSE 2021 - Lancaster, 英国期限: 17 8月 2021 → 21 8月 2021

出版系列

姓名	ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education

会议

会议	16th IEEE International Conference on Computer Science and Education, ICCSE 2021
国家/地区	英国
市	Lancaster
时期	17/08/21 → 21/08/21

访问文件

10.1109/ICCSE51940.2021.9569543

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, Z., & Zhao, S. (2021). Online imitation learning for self-driving simulation. 在 ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education (页码 810-815). (ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCSE51940.2021.9569543

@inproceedings{71b69df029d448cd90bf7fae473a06e9,

title = "Online imitation learning for self-driving simulation",

abstract = "The end-to-end autonomous driving policy has made great progress with the development of deep learning. The current methods are mainly divided into imitation learning and reinforcement learning. The method of imitation learning can quickly realize the one-to-one correspondence between states and actions, but is limited by the dataset and is prone to overfitting. Therefore, the current methods mainly focus on extracting more robust input state features and proposing a more generalized dataset. Reinforcement learning methods can obtain richer input states due to online training, but at the same time requires longer training time, so current methods mainly focus on reducing training time and designing appropriate rewards. In this paper, we propose an end-to-end temporal convolution model based on segmentation medium, which uses online imitation learning to obtain richer input states, train more robust policy networks. At the same time, to reduce the training time, we use our own designed segmentation medium to replace the raw sensor information as the input of the policy network. Experiments on the CARLA driving benchmarks show that our approach achieves satisfactory results and has excellent generalization ability.",

keywords = "Autonomous driving, Online imitation learning, Segmentation medium",

author = "Zhe Zhang and Sanyuan Zhao",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 16th IEEE International Conference on Computer Science and Education, ICCSE 2021 ; Conference date: 17-08-2021 Through 21-08-2021",

year = "2021",

month = aug,

day = "17",

doi = "10.1109/ICCSE51940.2021.9569543",

language = "English",

series = "ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "810--815",

booktitle = "ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education",

address = "United States",

}

Zhang, Z & Zhao, S 2021, Online imitation learning for self-driving simulation. 在 ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education. ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education, Institute of Electrical and Electronics Engineers Inc., 页码 810-815, 16th IEEE International Conference on Computer Science and Education, ICCSE 2021, Lancaster, 英国, 17/08/21. https://doi.org/10.1109/ICCSE51940.2021.9569543

Online imitation learning for self-driving simulation. / Zhang, Zhe; Zhao, Sanyuan.
ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education. Institute of Electrical and Electronics Engineers Inc., 2021. 页码 810-815 (ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Online imitation learning for self-driving simulation

AU - Zhang, Zhe

AU - Zhao, Sanyuan

PY - 2021/8/17

Y1 - 2021/8/17

N2 - The end-to-end autonomous driving policy has made great progress with the development of deep learning. The current methods are mainly divided into imitation learning and reinforcement learning. The method of imitation learning can quickly realize the one-to-one correspondence between states and actions, but is limited by the dataset and is prone to overfitting. Therefore, the current methods mainly focus on extracting more robust input state features and proposing a more generalized dataset. Reinforcement learning methods can obtain richer input states due to online training, but at the same time requires longer training time, so current methods mainly focus on reducing training time and designing appropriate rewards. In this paper, we propose an end-to-end temporal convolution model based on segmentation medium, which uses online imitation learning to obtain richer input states, train more robust policy networks. At the same time, to reduce the training time, we use our own designed segmentation medium to replace the raw sensor information as the input of the policy network. Experiments on the CARLA driving benchmarks show that our approach achieves satisfactory results and has excellent generalization ability.

AB - The end-to-end autonomous driving policy has made great progress with the development of deep learning. The current methods are mainly divided into imitation learning and reinforcement learning. The method of imitation learning can quickly realize the one-to-one correspondence between states and actions, but is limited by the dataset and is prone to overfitting. Therefore, the current methods mainly focus on extracting more robust input state features and proposing a more generalized dataset. Reinforcement learning methods can obtain richer input states due to online training, but at the same time requires longer training time, so current methods mainly focus on reducing training time and designing appropriate rewards. In this paper, we propose an end-to-end temporal convolution model based on segmentation medium, which uses online imitation learning to obtain richer input states, train more robust policy networks. At the same time, to reduce the training time, we use our own designed segmentation medium to replace the raw sensor information as the input of the policy network. Experiments on the CARLA driving benchmarks show that our approach achieves satisfactory results and has excellent generalization ability.

KW - Autonomous driving

KW - Online imitation learning

KW - Segmentation medium

UR - http://www.scopus.com/inward/record.url?scp=85118954584&partnerID=8YFLogxK

U2 - 10.1109/ICCSE51940.2021.9569543

DO - 10.1109/ICCSE51940.2021.9569543

M3 - Conference contribution

AN - SCOPUS:85118954584

T3 - ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education

SP - 810

EP - 815

BT - ICCSE 2021 - IEEE 16th International Conference on Computer Science and Education

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 16th IEEE International Conference on Computer Science and Education, ICCSE 2021

Y2 - 17 August 2021 through 21 August 2021

ER -

Online imitation learning for self-driving simulation

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此