TY - JOUR
T1 - Continuous Reinforcement Learning-Based Energy Management Strategy for Hybrid Electric-Tracked Vehicles
AU - Han, Ruoyan
AU - Lian, Renzong
AU - He, Hongwen
AU - Han, Xuefeng
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2023/2/1
Y1 - 2023/2/1
N2 - The hybrid electric-tracked vehicles (HETVs) are usually used in both agricultural and industrial applications, while the optimal energy management is critical to fully exploit the potential of HETVs. In this article, the influence of HETVs' steering resistance on the energy distribution is specially considered to model the dynamic demand accurately. Further, a multi-state energy management strategy (EMS) based on deep deterministic policy gradient (DDPG) is proposed for a series HETV in the continuous space. A multidimensional matrix framework is proposed to extract the parameters of the actor network from a trained DDPG-based EMS. Hardware-in-the-loop (HiL) experiment is conducted to validate the real-time tractability of the proposed strategy. Results suggest that the DDPG-based strategy improves the fuel economy remarkably by 13.1% and shows a more robust performance, compared with the double deep $Q$ -learning-based strategy. Though the proposed strategy is trained based on the fixed state of charge (SOC), it still exhibits a strong adaptability to the uncertainty of initial SOCs.
AB - The hybrid electric-tracked vehicles (HETVs) are usually used in both agricultural and industrial applications, while the optimal energy management is critical to fully exploit the potential of HETVs. In this article, the influence of HETVs' steering resistance on the energy distribution is specially considered to model the dynamic demand accurately. Further, a multi-state energy management strategy (EMS) based on deep deterministic policy gradient (DDPG) is proposed for a series HETV in the continuous space. A multidimensional matrix framework is proposed to extract the parameters of the actor network from a trained DDPG-based EMS. Hardware-in-the-loop (HiL) experiment is conducted to validate the real-time tractability of the proposed strategy. Results suggest that the DDPG-based strategy improves the fuel economy remarkably by 13.1% and shows a more robust performance, compared with the double deep $Q$ -learning-based strategy. Though the proposed strategy is trained based on the fixed state of charge (SOC), it still exhibits a strong adaptability to the uncertainty of initial SOCs.
KW - Energy management strategy (EMS)
KW - hardware-in-the-loop (HiL)
KW - hybrid electric vehicle (HEV)
KW - machine learning
KW - vehicle dynamics
UR - http://www.scopus.com/inward/record.url?scp=85121784018&partnerID=8YFLogxK
U2 - 10.1109/JESTPE.2021.3135059
DO - 10.1109/JESTPE.2021.3135059
M3 - Article
AN - SCOPUS:85121784018
SN - 2168-6777
VL - 11
SP - 19
EP - 31
JO - IEEE Journal of Emerging and Selected Topics in Power Electronics
JF - IEEE Journal of Emerging and Selected Topics in Power Electronics
IS - 1
ER -