跳到主要导航 跳到搜索 跳到主要内容

Safety-Constrained Reinforcement Learning Method for Urban Eco-Driving

  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Reinforcement Learning (RL) is used for eco-driving in Intelligent and Connected Vehicles (ICVs), but traditional methods face a critical trade-off. Ensuring safety often requires high penalty terms, leading to overly conservative policies that severely compromise energy and traffic efficiency. To resolve this conflict, this paper introduces the Twin Delayed Deep Deterministic Policy Gradient with Safelayer (TD3+Safelayer), a novel framework that decouples safety from performance objectives. The Safelayer module maps the TD3 policy's actions into a verifiably safe action space. SUMO simulations demonstrate that TD3+Safelayer eliminates all collision events present in the baseline models. While achieving this high level of safety, it also reduces energy consumption by 17.4% and 30.7% compared to traditional TD3 and rule-based strategies, respectively.

源语言英语
主期刊名2025 9th CAA International Conference on Vehicular Control and Intelligence, CVCI 2025
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331569068
DOI
出版状态已出版 - 2025
已对外发布
活动2025 9th CAA International Conference on Vehicular Control and Intelligence, CVCI 2025 - Qingdao, 中国
期限: 24 10月 202526 10月 2025

出版系列

姓名2025 9th CAA International Conference on Vehicular Control and Intelligence, CVCI 2025

会议

会议2025 9th CAA International Conference on Vehicular Control and Intelligence, CVCI 2025
国家/地区中国
Qingdao
时期24/10/2526/10/25

指纹

探究 'Safety-Constrained Reinforcement Learning Method for Urban Eco-Driving' 的科研主题。它们共同构成独一无二的指纹。

引用此