Risk-Aware Multi-Agent Safe Reinforcement Learning for Autonomous Vehicle On-Ramp Merging

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Ensuring safety while achieving better efficiency remain significant challenges for autonomous vehicles. Most current research treats safety metrics as components at the same level as efficiency, environmental friendliness, and comfort in the reward function, overlooking the fact that in driving behavior, safety should be a constraint that takes precedence over other components. Additionally, driving risk prediction is rarely considered, resulting in either overly aggressive or excessively conservative policies. We model on-ramp merging in mixed traffic as a multi-agent cooperative framework and propose a reinforcement learning algorithm based on primal-dual optimization. The technique features an upper-level decision system formulated as a partially observable constrained Markov decision process that incorporates risk prediction and lane-changing efficiency, complemented by a lower-level knowledge-enhanced risk prediction system. An action mask module is introduced to filter out unsafe and invalid actions. Simulation experiments demonstrate that this approach significantly reduces unsafe behaviors without compromising traffic efficiency, while the combination of risk prediction and action masking also reduces unsafe behaviors during training, offering a promising solution for autonomous driving sim-to-real applications.

Original languageEnglish
Title of host publicationIntelligent Transportation Engineering - Proceedings of the 10th International Conference on Intelligent Transportation Engineering, ICITE 2025
EditorsYanyan Chen
PublisherIOS Press BV
Pages73-85
Number of pages13
ISBN (Electronic)9781643686400
DOIs
Publication statusPublished - 8 Jan 2026
Event10th International Conference on Intelligent Transportation Engineering, ICITE 2025 - Beijing, China
Duration: 24 Oct 202526 Oct 2025

Publication series

NameAdvances in Transdisciplinary Engineering
Volume84
ISSN (Print)2352-751X
ISSN (Electronic)2352-7528

Conference

Conference10th International Conference on Intelligent Transportation Engineering, ICITE 2025
Country/TerritoryChina
CityBeijing
Period24/10/2526/10/25

Keywords

  • Action Masking
  • Autonomous driving
  • Constrained Reinforcement Learning
  • On-ramp Merging
  • Risk Prediction

Fingerprint

Dive into the research topics of 'Risk-Aware Multi-Agent Safe Reinforcement Learning for Autonomous Vehicle On-Ramp Merging'. Together they form a unique fingerprint.

Cite this