TY - JOUR
T1 - Energy management optimization for connected hybrid electric vehicle using offline reinforcement learning
AU - He, Hongwen
AU - Niu, Zegong
AU - Wang, Yong
AU - Huang, Ruchen
AU - Shou, Yiwen
N1 - Publisher Copyright:
© 2023
PY - 2023/11/25
Y1 - 2023/11/25
N2 - Energy management strategy (EMS) is critical to ensure the long-term energy economy of hybrid electric vehicles. The classical deep reinforcement learning algorithms exist many issues such as the safety constraint and the simulation-to-real gap, resulting in difficulties in applications to industrial tasks. Thus, this paper proposes a novel EMS and a policy updating method based on the offline deep reinforcement learning algorithm to address the energy optimization problem. Firstly, the batch-constrained deep Q-learning algorithm is applied to provide a solution training the control strategy based on the existing datasets without interaction with the environment. Secondly, the EMS updating method is proposed to improve the adaptability under complicated driving cycles. The Jensen–Shannon divergence is introduced to determine when offline control strategy updates in real time. Finally, the optimality and effectiveness of the proposed EMS are validated, and the real-time and adaptive performance of the proposed method is also verified. The results indicate that the proposed EMS can learn a superior policy from fixed data, and the proposed updating method can utilize the real-time data to update the offline policy approaching the online deep reinforcement learning-based strategy in energy consumption.
AB - Energy management strategy (EMS) is critical to ensure the long-term energy economy of hybrid electric vehicles. The classical deep reinforcement learning algorithms exist many issues such as the safety constraint and the simulation-to-real gap, resulting in difficulties in applications to industrial tasks. Thus, this paper proposes a novel EMS and a policy updating method based on the offline deep reinforcement learning algorithm to address the energy optimization problem. Firstly, the batch-constrained deep Q-learning algorithm is applied to provide a solution training the control strategy based on the existing datasets without interaction with the environment. Secondly, the EMS updating method is proposed to improve the adaptability under complicated driving cycles. The Jensen–Shannon divergence is introduced to determine when offline control strategy updates in real time. Finally, the optimality and effectiveness of the proposed EMS are validated, and the real-time and adaptive performance of the proposed method is also verified. The results indicate that the proposed EMS can learn a superior policy from fixed data, and the proposed updating method can utilize the real-time data to update the offline policy approaching the online deep reinforcement learning-based strategy in energy consumption.
KW - Energy management strategy (EMS)
KW - Hybrid electric vehicle (HEV)
KW - Jensen–Shannon divergence
KW - Offline deep reinforcement learning (DRL)
UR - http://www.scopus.com/inward/record.url?scp=85166184363&partnerID=8YFLogxK
U2 - 10.1016/j.est.2023.108517
DO - 10.1016/j.est.2023.108517
M3 - Article
AN - SCOPUS:85166184363
SN - 2352-152X
VL - 72
JO - Journal of Energy Storage
JF - Journal of Energy Storage
M1 - 108517
ER -