TY - JOUR
T1 - Ecological Driving Framework of Hybrid Electric Vehicle Based on Heterogeneous Multi-Agent Deep Reinforcement Learning
AU - Peng, Jiankun
AU - Chen, Weiqi
AU - Fan, Yi
AU - He, Hongwen
AU - Wei, Zhongbao
AU - Ma, Chunye
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2024/3/1
Y1 - 2024/3/1
N2 - Hybrid electric vehicles (HEVs) have great potential to be discovered in terms of energy saving and emission reduction, and ecological driving provides theoretical guidance for giving full play to their advantages in real traffic scenarios. In order to implement an ecological driving strategy with the lowest cost throughout the life cycle in a car-following scenario, the safety and comfort, fuel economy, and battery health need to be considered, which is a complex nonlinear and multiobjective coupled optimization task. Therefore, a novel multi-agent deep deterministic policy gradient (MADDPG) based framework with two heterogeneous agents to optimize adaptive cruise control (ACC) and energy management strategy (EMS), respectively, is proposed, thereby decoupling optimization objectives of different domains. Because of the asynchronous of multi-agents, different learning rate schedules are analyzed to coordinate the learning process to optimize training results; an improvement on the prioritized experience replay (PER) technique is proposed, which improves the optimization performance of the original MADDPG method by more than 10%. Simulations under mixed driving cycles show that, on the premise of ensuring car-following performance, the overall driving cost, including fuel consumption and battery health degradation of the MADDPG-based method, can reach 93.88% of that of DP, and the proposed algorithm has good adaptability to different driving conditions.
AB - Hybrid electric vehicles (HEVs) have great potential to be discovered in terms of energy saving and emission reduction, and ecological driving provides theoretical guidance for giving full play to their advantages in real traffic scenarios. In order to implement an ecological driving strategy with the lowest cost throughout the life cycle in a car-following scenario, the safety and comfort, fuel economy, and battery health need to be considered, which is a complex nonlinear and multiobjective coupled optimization task. Therefore, a novel multi-agent deep deterministic policy gradient (MADDPG) based framework with two heterogeneous agents to optimize adaptive cruise control (ACC) and energy management strategy (EMS), respectively, is proposed, thereby decoupling optimization objectives of different domains. Because of the asynchronous of multi-agents, different learning rate schedules are analyzed to coordinate the learning process to optimize training results; an improvement on the prioritized experience replay (PER) technique is proposed, which improves the optimization performance of the original MADDPG method by more than 10%. Simulations under mixed driving cycles show that, on the premise of ensuring car-following performance, the overall driving cost, including fuel consumption and battery health degradation of the MADDPG-based method, can reach 93.88% of that of DP, and the proposed algorithm has good adaptability to different driving conditions.
KW - Adaptive cruise control (ACC)
KW - battery health
KW - deep deterministic policy gradient
KW - ecological driving
KW - energy management
KW - heterogeneous multi-agent
UR - http://www.scopus.com/inward/record.url?scp=85161035133&partnerID=8YFLogxK
U2 - 10.1109/TTE.2023.3278350
DO - 10.1109/TTE.2023.3278350
M3 - Article
AN - SCOPUS:85161035133
SN - 2332-7782
VL - 10
SP - 392
EP - 406
JO - IEEE Transactions on Transportation Electrification
JF - IEEE Transactions on Transportation Electrification
IS - 1
ER -