智能网联汽车基于逆强化学习的轨迹规划优化机制研究

Haonan Peng, Minghuan Tang, Qiwen Zha, Cong Wang*, Weida Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

5 引用 (Scopus)

摘要

Trajectory planning is one of the most significant technologies of autonomous connected vehicles. However, there are some problems in existing trajectory planning strategy, for example, weak real-time ability, difficult to calibrate weighting coefficients of optimization objectives and the poor interpretability for direct imitation learning method in the trajectory planning strategy. Therefore, an inverse reinforcement learning (IRL) method was proposed based on the maximum entropy principle in this paper. Learning the underlying optimization mechanism of driving trajectories from experienced drivers, the planning of lane-changing expert trajectories was achieved aligning with the human driving experience, laying a theoretical foundation for solving the real-time and interpretability problems of trajectory planning methods. Finally, taking general risk scenarios and high-risk scenarios as application cases respectively, the feasibility and effectiveness of the proposed trajectory planning method were validated through Matlab/Simulink simulations.

投稿的翻译标题Research on Inverse Reinforcement Learning-Based Trajectory Planning Optimization Mechanism for Autonomous Connected Vehicles
源语言繁体中文
页(从-至)820-831
页数12
期刊Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
43
8
DOI
出版状态已出版 - 8月 2023

关键词

  • autonomous connected vehicles
  • inverse reinforcement learning (IRL) method
  • maximum entropy principle
  • trajectory planning

指纹

探究 '智能网联汽车基于逆强化学习的轨迹规划优化机制研究' 的科研主题。它们共同构成独一无二的指纹。

引用此