TY - JOUR
T1 - Integrated thermal-energy management for electric vehicles in high-temperature conditions using hierarchical reinforcement learning
AU - Guo, Xin
AU - Peng, Jiankun
AU - He, Hongwen
AU - Wu, Changcheng
AU - Zhang, Hailong
AU - Ma, Chunye
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/6/1
Y1 - 2025/6/1
N2 - The integrated thermal management system (TMS) expends a considerable quantity of energy in high-temperature environments to maintain the battery and motor systems within a reasonable temperature range, while also affecting the overall vehicle's safety and stable operation. In light of these considerations, a hierarchical reinforcement learning approach for integrated TMS is proposed. Given the low-frequency operation of the thermal system, the study employs the Deep Deterministic Policy Gradient (DDPG) with experience replay technical (DDPG-E) to investigate the battery-motor TMS control strategy under car-following behavior. This approach transforms the trade-off between high-frequency planning and low-frequency execution into a Markov Decision Process (MDP), thereby enabling coordinated optimization of multiple objectives. Specifically, in the pre-optimization layer, based on DDPG-E the heat generation optimization (HGO-DDPG-E) is introduced as a multi-objective optimization criterion to achieve “active load reduction” for the battery-motor system. Subsequently, the battery-motor temperature difference and energy consumption of TMS ancillary components are employed as constraints at the integrated control layer for all TMS components, based on the pre-optimization results. The results of the simulation demonstrate that the proposed method achieves an optimization of 15.3% in heat generation and a 14.1% reduction in TMS energy consumption.
AB - The integrated thermal management system (TMS) expends a considerable quantity of energy in high-temperature environments to maintain the battery and motor systems within a reasonable temperature range, while also affecting the overall vehicle's safety and stable operation. In light of these considerations, a hierarchical reinforcement learning approach for integrated TMS is proposed. Given the low-frequency operation of the thermal system, the study employs the Deep Deterministic Policy Gradient (DDPG) with experience replay technical (DDPG-E) to investigate the battery-motor TMS control strategy under car-following behavior. This approach transforms the trade-off between high-frequency planning and low-frequency execution into a Markov Decision Process (MDP), thereby enabling coordinated optimization of multiple objectives. Specifically, in the pre-optimization layer, based on DDPG-E the heat generation optimization (HGO-DDPG-E) is introduced as a multi-objective optimization criterion to achieve “active load reduction” for the battery-motor system. Subsequently, the battery-motor temperature difference and energy consumption of TMS ancillary components are employed as constraints at the integrated control layer for all TMS components, based on the pre-optimization results. The results of the simulation demonstrate that the proposed method achieves an optimization of 15.3% in heat generation and a 14.1% reduction in TMS energy consumption.
KW - Deep deterministic policy gradient
KW - Electrical vehicles
KW - Energy consumption optimization
KW - Thermal management
UR - http://www.scopus.com/inward/record.url?scp=86000670866&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2025.127221
DO - 10.1016/j.eswa.2025.127221
M3 - Article
AN - SCOPUS:86000670866
SN - 0957-4174
VL - 276
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 127221
ER -