TY - JOUR
T1 - Heuristic Energy Management Strategy of Hybrid Electric Vehicle Based on Deep Reinforcement Learning with Accelerated Gradient Optimization
AU - Du, Guodong
AU - Zou, Yuan
AU - Zhang, Xudong
AU - Guo, Lingxiong
AU - Guo, Ningyuan
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2021/12/1
Y1 - 2021/12/1
N2 - In this article, a heuristic deep reinforcement learning (DRL) control strategy is proposed for the energy management of the series hybrid electric vehicle (SHEV). First, the powertrain model of the vehicle and the formulas of the energy management strategy (EMS) are introduced. Then, the complete control framework with a nested loop logic is constructed for the EMS. In this control framework, the heuristic experience replay (HER) is proposed to achieve more reasonable experience sampling and improve training efficiency. Besides, the adaptive moment estimation optimization method with the Nesterov accelerated gradient called NAG-Adam is presented to achieve a better optimization effect. Subsequently, the performance of the proposed control strategy is verified by the high-precision driving cycle. The simulation results show that the newly proposed method can achieve faster training speed and higher fuel economy compared to the existing DRL methods and is close to the global optimum. Finally, the adaptability, stability, and robustness of the proposed method are verified by applying different driving cycles.
AB - In this article, a heuristic deep reinforcement learning (DRL) control strategy is proposed for the energy management of the series hybrid electric vehicle (SHEV). First, the powertrain model of the vehicle and the formulas of the energy management strategy (EMS) are introduced. Then, the complete control framework with a nested loop logic is constructed for the EMS. In this control framework, the heuristic experience replay (HER) is proposed to achieve more reasonable experience sampling and improve training efficiency. Besides, the adaptive moment estimation optimization method with the Nesterov accelerated gradient called NAG-Adam is presented to achieve a better optimization effect. Subsequently, the performance of the proposed control strategy is verified by the high-precision driving cycle. The simulation results show that the newly proposed method can achieve faster training speed and higher fuel economy compared to the existing DRL methods and is close to the global optimum. Finally, the adaptability, stability, and robustness of the proposed method are verified by applying different driving cycles.
KW - Energy management control
KW - Nesterov accelerated gradient (NAG)
KW - heuristic experience replay (HER)
KW - nested loop logic
KW - series hybrid electric tracked vehicle
UR - http://www.scopus.com/inward/record.url?scp=85112199516&partnerID=8YFLogxK
U2 - 10.1109/TTE.2021.3088853
DO - 10.1109/TTE.2021.3088853
M3 - Article
AN - SCOPUS:85112199516
SN - 2332-7782
VL - 7
SP - 2194
EP - 2208
JO - IEEE Transactions on Transportation Electrification
JF - IEEE Transactions on Transportation Electrification
IS - 4
ER -