Autonomous Vehicles Roundup Strategy by Reinforcement Learning with Prediction Trajectory

Jiayang Ni, Rubing Ma, Hua Zhong, Bo Wang

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Autonomous vehicles are increasingly applied on many situations, but their autonomous decision-making ability needs to be improved. Multi-Agent Deep Deterministic Policy Gradient(MADDPG) adopts the method of centralized evaluation and decentralized execution, so that the autonomous vehicle can obtain the whole-field status information and make decisions through the companion information. In the process of autonomous vehicle training, we introduce artificial potential field, action guidance and other methods to alleviate the problem of sparse rewards. At the same time, we add a repulsion function to consider the relationship between team vehicles. Extended Kalman Filter(EKF) is also applied to predict the autonomous vehicle trajectory, changing the training network state input information. At the same time, secondary correction of the predicted autonomous vehicle trajectory is made to change the prediction range with the training time, and improve the training convergence speed while the speed of opposite agents increases. Simulation experiments show that the convergence speed and win rate of MADDPG algorithm based on trajectory prediction and artificial potential field is significantly improved, and it also has strong adaptability to various task scenarios.

源语言英语
主期刊名Proceedings of the 41st Chinese Control Conference, CCC 2022
编辑Zhijun Li, Jian Sun
出版商IEEE Computer Society
3370-3375
页数6
ISBN(电子版)9789887581536
DOI
出版状态已出版 - 2022
活动41st Chinese Control Conference, CCC 2022 - Hefei, 中国
期限: 25 7月 202227 7月 2022

出版系列

姓名Chinese Control Conference, CCC
2022-July
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议41st Chinese Control Conference, CCC 2022
国家/地区中国
Hefei
时期25/07/2227/07/22

指纹

探究 'Autonomous Vehicles Roundup Strategy by Reinforcement Learning with Prediction Trajectory' 的科研主题。它们共同构成独一无二的指纹。

引用此