TY - JOUR
T1 - Merging planning in dense traffic scenarios using interactive safe reinforcement learning
AU - Hou, Xiaohui
AU - Gan, Minggang
AU - Wu, Wei
AU - Wang, Chenyu
AU - Ji, Yuan
AU - Zhao, Shiyue
N1 - Publisher Copyright:
© 2024
PY - 2024/4/22
Y1 - 2024/4/22
N2 - Autonomous navigation in dense traffic scenarios, such as on-ramp forced merging, still poses significant challenges for autonomous vehicles to prevent accidents and alleviate traffic congestion. This paper introduces a novel motion planning framework that combines Interactive Safe Reinforcement Learning (IntSRL) with Nonlinear Model Predictive Control (NMPC). This framework develops an interactive merging planning policy that accounts for the uncertainty of traffic participants, multi-objective optimization and heterogeneous vehicle interactions, in which the upper planner, i.e., IntSRL, furnishes the lower planner, NMPC, with global guidance path and velocity guidance. An Adaptive Safety Governor (ASG) module within IntSRL adjusts potentially unsafe actions by incorporating prior knowledge and driving experience. And a coupling evaluation mechanism for multi-objective optimization is embedded into reward shaping with integration of driving safety and strategy efficiency. We evaluate the proposed controller on various dense traffic scenarios using the proposed Heterogeneous Intelligent Driver Model (H-IDM) considering different driving styles and cooperative willingness of other vehicles. The test results indicate that the proposed method surpasses existing optimization-based and learning-based baselines in qualitative and quantitative measures.
AB - Autonomous navigation in dense traffic scenarios, such as on-ramp forced merging, still poses significant challenges for autonomous vehicles to prevent accidents and alleviate traffic congestion. This paper introduces a novel motion planning framework that combines Interactive Safe Reinforcement Learning (IntSRL) with Nonlinear Model Predictive Control (NMPC). This framework develops an interactive merging planning policy that accounts for the uncertainty of traffic participants, multi-objective optimization and heterogeneous vehicle interactions, in which the upper planner, i.e., IntSRL, furnishes the lower planner, NMPC, with global guidance path and velocity guidance. An Adaptive Safety Governor (ASG) module within IntSRL adjusts potentially unsafe actions by incorporating prior knowledge and driving experience. And a coupling evaluation mechanism for multi-objective optimization is embedded into reward shaping with integration of driving safety and strategy efficiency. We evaluate the proposed controller on various dense traffic scenarios using the proposed Heterogeneous Intelligent Driver Model (H-IDM) considering different driving styles and cooperative willingness of other vehicles. The test results indicate that the proposed method surpasses existing optimization-based and learning-based baselines in qualitative and quantitative measures.
KW - Autonomous driving
KW - Dense traffic scenario
KW - Motion planning
KW - Reinforcement learning
KW - Safety constraint
UR - http://www.scopus.com/inward/record.url?scp=85186531457&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2024.111548
DO - 10.1016/j.knosys.2024.111548
M3 - Article
AN - SCOPUS:85186531457
SN - 0950-7051
VL - 290
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 111548
ER -