TY - JOUR
T1 - A reinforcement learning-based energy management strategy for a battery–ultracapacitor electric vehicle considering temperature effects
AU - Wang, Chun
AU - Liu, Rui
AU - Tang, Aihua
AU - Zhang, Zhigang
AU - Liu, Pu
N1 - Publisher Copyright:
© 2023 John Wiley & Sons Ltd.
PY - 2023/10
Y1 - 2023/10
N2 - The design of energy management strategy (EMS) plays a vital role in the power performance and economy of battery–ultracapacitor for electric vehicles. A reinforcement learning (RL)-based EMS is proposed to obtain an optimal power allocation strategy for battery–ultracapacitor electric vehicle, and its robustness is verified at different temperatures. First of all, the dynamic characteristic experiments of the battery and ultracapacitor were performed at 10°C, 25°C, and 40°C to obtain mechanism characteristics at different temperatures. Secondly, a genetic algorithm is selected to identify the parameters of the battery and ultracapacitor model. Next, the RL-based strategy takes the minimum energy loss of the hybrid energy storage system as the reward function and solves the optimal policy based on Markov theory. The simulation results show that the economy of the RL-based strategy correspondingly improved by 3.05%, 3.20%, and 3.15% at different temperatures in comparison with the fuzzy-based strategy, and the economic gap between the RL-based strategy and the DP-based strategy is further narrowed down to 7.30%, 3.88%, and 8.40% at different temperatures, respectively. Finally, the proposed strategy is validated under different driving conditions, which indicate that the RL-based strategy can effectively reduce energy consumption and has good robustness at different temperatures.
AB - The design of energy management strategy (EMS) plays a vital role in the power performance and economy of battery–ultracapacitor for electric vehicles. A reinforcement learning (RL)-based EMS is proposed to obtain an optimal power allocation strategy for battery–ultracapacitor electric vehicle, and its robustness is verified at different temperatures. First of all, the dynamic characteristic experiments of the battery and ultracapacitor were performed at 10°C, 25°C, and 40°C to obtain mechanism characteristics at different temperatures. Secondly, a genetic algorithm is selected to identify the parameters of the battery and ultracapacitor model. Next, the RL-based strategy takes the minimum energy loss of the hybrid energy storage system as the reward function and solves the optimal policy based on Markov theory. The simulation results show that the economy of the RL-based strategy correspondingly improved by 3.05%, 3.20%, and 3.15% at different temperatures in comparison with the fuzzy-based strategy, and the economic gap between the RL-based strategy and the DP-based strategy is further narrowed down to 7.30%, 3.88%, and 8.40% at different temperatures, respectively. Finally, the proposed strategy is validated under different driving conditions, which indicate that the RL-based strategy can effectively reduce energy consumption and has good robustness at different temperatures.
KW - energy management strategy
KW - parameter identification
KW - reinforcement learning
KW - temperature effect
UR - http://www.scopus.com/inward/record.url?scp=85159051201&partnerID=8YFLogxK
U2 - 10.1002/cta.3656
DO - 10.1002/cta.3656
M3 - Article
AN - SCOPUS:85159051201
SN - 0098-9886
VL - 51
SP - 4690
EP - 4710
JO - International Journal of Circuit Theory and Applications
JF - International Journal of Circuit Theory and Applications
IS - 10
ER -