TY - JOUR
T1 - Battery-Aware Cooperative Merging Strategy of Connected Electric Vehicles Based on Reinforcement Learning with Hindsight Experience Replay
AU - Dong, Hanxuan
AU - Zhang, Hailong
AU - Ding, Fan
AU - Tan, Huachun
AU - Peng, Jiankun
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2022/9/1
Y1 - 2022/9/1
N2 - Cooperative control of connected and automated electric vehicles (CAEVs) offers a great potential for a safe, high-efficient, sustainable transportation system. Among them, coordinated on-ramp merging based on an on-ramp merging vehicle and a mainline facilitating is a hot spot to lighten the impact of shockwave on highway junctions. Reinforcement learning (RL) is a promising solution to address this problem for its strong adaptiveness and self-learning ability. The existing methods are dedicatedly designed to circumvent the sparse reward problem of long-term merging safety but restrict the optimization space and neglect individual costs, especially irreversible loss, such as battery aging. In this article, a hindsight experience replay RL coordinated freeway on-ramp merging frame is proposed to face the sparse reward problem in ramp merging problem and, meanwhile, co-optimize traffic efficiency, energy-saving, and battery health performance. Compared with the existing optimization-based strategy, it fundamentally avoids abundant expert knowledge and elaborate design for reward shaping and model construction while ensuring the nature of merging tasks without excessive simplification. Numerical experiments are conducted to verify the optimality, adaptability, and self-learning ability of the proposed strategy. With comparison experiment, the proposed strategy surpasses state-of-the-art RL methods more than 13% in overall index throughput with 12% energy consumption while easing individual battery aging during the merging process.
AB - Cooperative control of connected and automated electric vehicles (CAEVs) offers a great potential for a safe, high-efficient, sustainable transportation system. Among them, coordinated on-ramp merging based on an on-ramp merging vehicle and a mainline facilitating is a hot spot to lighten the impact of shockwave on highway junctions. Reinforcement learning (RL) is a promising solution to address this problem for its strong adaptiveness and self-learning ability. The existing methods are dedicatedly designed to circumvent the sparse reward problem of long-term merging safety but restrict the optimization space and neglect individual costs, especially irreversible loss, such as battery aging. In this article, a hindsight experience replay RL coordinated freeway on-ramp merging frame is proposed to face the sparse reward problem in ramp merging problem and, meanwhile, co-optimize traffic efficiency, energy-saving, and battery health performance. Compared with the existing optimization-based strategy, it fundamentally avoids abundant expert knowledge and elaborate design for reward shaping and model construction while ensuring the nature of merging tasks without excessive simplification. Numerical experiments are conducted to verify the optimality, adaptability, and self-learning ability of the proposed strategy. With comparison experiment, the proposed strategy surpasses state-of-the-art RL methods more than 13% in overall index throughput with 12% energy consumption while easing individual battery aging during the merging process.
KW - Battery health
KW - connected and automated vehicles (CAVs)
KW - coordinated on-ramp merging
KW - hindsight experience replay
KW - reinforcement learning (RL)
UR - http://www.scopus.com/inward/record.url?scp=85122056304&partnerID=8YFLogxK
U2 - 10.1109/TTE.2021.3138140
DO - 10.1109/TTE.2021.3138140
M3 - Article
AN - SCOPUS:85122056304
SN - 2332-7782
VL - 8
SP - 3725
EP - 3741
JO - IEEE Transactions on Transportation Electrification
JF - IEEE Transactions on Transportation Electrification
IS - 3
ER -