TY - JOUR
T1 - Real-time adaptive energy management strategy based on multi-agent collaborative decision-time planning for off-road hybrid electric vehicles
AU - Ma, Xiaokang
AU - Liu, Hui
AU - Han, Lijin
AU - Yang, Ningkang
AU - Guo, Congshuai
N1 - Publisher Copyright:
© 2026 Elsevier Ltd.
PY - 2026/7/1
Y1 - 2026/7/1
N2 - Real-time energy management for off-road hybrid electric vehicles (HEVs) poses significant challenges under variable and unknown driving conditions. Spurred by this challenge, this paper proposes a real-time adaptive energy management strategy (EMS) based on a model-based reinforcement learning (MBRL). Within the MBRL framework, a reinforcement learning (RL) oriented environment model is first constructed, consisting of a virtual vehicle model that represents a deterministic powertrain and an online Markov chain (MC) model to reflect dynamic driving conditions. Notably, both models support continuous online updates. Secondly, a novel multi-agent collaborative decision-time planning (DTP) algorithm is introduced. Unlike conventional RL methods that require learning a complete driving cycle, it learns the optimal action for each actual encountered vehicle state within the environment model. Moreover, its internal multi-agent collaborative mechanism combines a series of agent strategies, trained to converge under typical driving cycles, ensuring real-time performance. Simulation results under unknown off-road conditions demonstrate that the proposed strategy incurs only a 2.4% increase in fuel consumption and a 2.5% increase in state of health (SOH) compared to dynamic programming (DP), while maintaining a highly similar state of charge (SOC) trajectory. Simultaneously, the single-step computation time is merely 10.7 ms. It also significantly outperforms model-free Q-learning (MFQL) and rule-based strategies. Moreover, its consistent performance across three additional unknown driving cycles confirms its strong robustness. Finally, the effectiveness of the proposed strategy is validated on a hardware-in-the-loop (HIL) platform.
AB - Real-time energy management for off-road hybrid electric vehicles (HEVs) poses significant challenges under variable and unknown driving conditions. Spurred by this challenge, this paper proposes a real-time adaptive energy management strategy (EMS) based on a model-based reinforcement learning (MBRL). Within the MBRL framework, a reinforcement learning (RL) oriented environment model is first constructed, consisting of a virtual vehicle model that represents a deterministic powertrain and an online Markov chain (MC) model to reflect dynamic driving conditions. Notably, both models support continuous online updates. Secondly, a novel multi-agent collaborative decision-time planning (DTP) algorithm is introduced. Unlike conventional RL methods that require learning a complete driving cycle, it learns the optimal action for each actual encountered vehicle state within the environment model. Moreover, its internal multi-agent collaborative mechanism combines a series of agent strategies, trained to converge under typical driving cycles, ensuring real-time performance. Simulation results under unknown off-road conditions demonstrate that the proposed strategy incurs only a 2.4% increase in fuel consumption and a 2.5% increase in state of health (SOH) compared to dynamic programming (DP), while maintaining a highly similar state of charge (SOC) trajectory. Simultaneously, the single-step computation time is merely 10.7 ms. It also significantly outperforms model-free Q-learning (MFQL) and rule-based strategies. Moreover, its consistent performance across three additional unknown driving cycles confirms its strong robustness. Finally, the effectiveness of the proposed strategy is validated on a hardware-in-the-loop (HIL) platform.
KW - Model-based reinforcement learning
KW - Multi-agent collaborative decision-time planning
KW - Off-road hybrid electric vehicle
KW - Real-time energy management
UR - https://www.scopus.com/pages/publications/105033716930
U2 - 10.1016/j.engappai.2026.114598
DO - 10.1016/j.engappai.2026.114598
M3 - Article
AN - SCOPUS:105033716930
SN - 0952-1976
VL - 175
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
M1 - 114598
ER -