基于归一化优势函数的强化学习混合动力履带车辆能量管理

Translated title of the contribution: Energy Management of Hybrid Tracked Vehicle Based on Reinforcement Learning with Normalized Advantage Function

Yuan Zou, Bin Zhang, Xudong Zhang, Zhiying Zhao, Tieyu Kang, Yufeng Guo, Zhe Wu

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

The energy management strategy based on reinforcement learning encounters the problem of "dimension disaster"when dealing with high-dimensional problems because of the discretization of state and control variables. For this problem, a new energy management algorithm based on deep reinforcement learning with normalized advantage function is proposed, where two deep neural networks with normalized advantage function are used to realize the continuous control of energy and eliminate the discretization of state and control variables. Based on the modeling of powertrain of a series hybrid tracked vehicle, the framework of the proposed deep reinforcement learning algorithm was built and the parameter update process was completed for the series hybrid tracked vehicle. The simulated results show that the proposed algorithm can output more refined control quantity and less output fluctuation. Compared with the deep Q-learning algorithm, the proposed algorithm improves the fuel economy of series hybrid tracked vehicle by 3.96%. In addition, the adaptability of the proposed algorithm and the optimized effect in real-time control environment are verified by the hardware-in-the-loop simulation.

Translated title of the contributionEnergy Management of Hybrid Tracked Vehicle Based on Reinforcement Learning with Normalized Advantage Function
Original languageChinese (Traditional)
Pages (from-to)2159-2169
Number of pages11
JournalBinggong Xuebao/Acta Armamentarii
Volume42
Issue number10
DOIs
Publication statusPublished - Oct 2021

Fingerprint

Dive into the research topics of 'Energy Management of Hybrid Tracked Vehicle Based on Reinforcement Learning with Normalized Advantage Function'. Together they form a unique fingerprint.

Cite this