Skip to main navigation Skip to search Skip to main content

Safety-Constrained Reinforcement Learning Method for Urban Eco-Driving

  • Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Reinforcement Learning (RL) is used for eco-driving in Intelligent and Connected Vehicles (ICVs), but traditional methods face a critical trade-off. Ensuring safety often requires high penalty terms, leading to overly conservative policies that severely compromise energy and traffic efficiency. To resolve this conflict, this paper introduces the Twin Delayed Deep Deterministic Policy Gradient with Safelayer (TD3+Safelayer), a novel framework that decouples safety from performance objectives. The Safelayer module maps the TD3 policy's actions into a verifiably safe action space. SUMO simulations demonstrate that TD3+Safelayer eliminates all collision events present in the baseline models. While achieving this high level of safety, it also reduces energy consumption by 17.4% and 30.7% compared to traditional TD3 and rule-based strategies, respectively.

Original languageEnglish
Title of host publication2025 9th CAA International Conference on Vehicular Control and Intelligence, CVCI 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331569068
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event2025 9th CAA International Conference on Vehicular Control and Intelligence, CVCI 2025 - Qingdao, China
Duration: 24 Oct 202526 Oct 2025

Publication series

Name2025 9th CAA International Conference on Vehicular Control and Intelligence, CVCI 2025

Conference

Conference2025 9th CAA International Conference on Vehicular Control and Intelligence, CVCI 2025
Country/TerritoryChina
CityQingdao
Period24/10/2526/10/25

Keywords

  • Eco-driving
  • Safe Reinforcement Learning
  • Safelayer
  • Urban Driving

Fingerprint

Dive into the research topics of 'Safety-Constrained Reinforcement Learning Method for Urban Eco-Driving'. Together they form a unique fingerprint.

Cite this