Skip to main navigation Skip to search Skip to main content

Online Updating Energy Management Strategy Based on Deep Reinforcement Learning with Accelerated Training for Hybrid Electric Tracked Vehicles

  • Bin Zhang
  • , Yuan Zou*
  • , Xudong Zhang
  • , Guodong Du
  • , Feixiang Jiao
  • , Ningyuan Guo
  • *Corresponding author for this work
  • Beijing Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

An online updating energy management strategy (EMS) based on deep reinforcement learning (DRL) with accelerated training is proposed to further reduce fuel consumption and improve the adaptability of the algorithm. The online frame continuously updates neural network parameters every predetermined time. First, the mathematical model of a series hybrid electric tracked vehicle (SHETV) is established, and the accuracy of the model is verified by collecting data from the real vehicle. Second, an intelligent EMS is developed by combining deep deterministic policy gradient (DDPG) and prioritized experience replay (PER). DDPG can improve the control effect and accelerate the training process by eliminating the discretization of variables. The addition of PER can further improve fuel economy and SOC performance and shorten the training time. Third, an online updating framework based on DDPG-PER is proposed to make EMS better adapt to complex driving conditions. Finally, a software-in-the-loop simulation is built, and the effectiveness of the proposed online updating EMS is verified by comparison with other state-of-the-art algorithms. Simulation results show that the training process of the off-line EMS is greatly accelerated, which provides a potential for online application. Meanwhile, the fuel economy of the online updating EMS reached 93.9% of the benchmark DP.

Original languageEnglish
Pages (from-to)3289-3306
Number of pages18
JournalIEEE Transactions on Transportation Electrification
Volume8
Issue number3
DOIs
Publication statusPublished - 1 Sept 2022

Keywords

  • Accelerated training
  • deep deterministic policy gradient (DDPG)
  • energy management strategy (EMS)
  • online updating framework
  • software-in-the-loop simulation

Fingerprint

Dive into the research topics of 'Online Updating Energy Management Strategy Based on Deep Reinforcement Learning with Accelerated Training for Hybrid Electric Tracked Vehicles'. Together they form a unique fingerprint.

Cite this