Tackling SOC long-term dynamic for energy management of hybrid electric buses via adaptive policy optimization

Hailong Zhang, Jiankun Peng*, Huachun Tan, Hanxuan Dong, Fan Ding, Bin Ran

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

23 引用 (Scopus)

摘要

Plug-in hybrid electric buses (PHEBs) have the potential to satisfy both the fuel efficiency and the driving-mileage under complex urban traffic conditions. However, the optimal charge and discharge management is still a pivotal challenge of energy management for the inherent uncertainty in driving conditions. The common reference state-of-charge (SOC) profile based methods are limited by the adaptiveness which restricts the economic performance of on-line energy management systems. Promisingly, reinforcement learning based energy management strategies exhibited the significant self-learning ability. However, for PHEBs, the sparse rewards by the long-term SOC shortage make the strategies easily trick into the local optimal solution. The work presented in this paper concentrates on combining battery power reduction in the form of conditional entropy into reinforcement learning based energy management strategy. The proposed method named adaptive policy optimization (APO) introduces a novel advantage function to evaluate energy-saving performance considering long-term SOC dynamic, and a Bayesian neural network based SOC shortage probability estimator is utilized to optimize the energy management strategy parameterized by a deep neural network. Several experiments in a standard driving cycle demonstrate the optimality, self-learning ability and convergence of the APO. Moreover, the adaptability and robust performance get validated over the real bus trajectories data. With the comprehensive experiments in this paper, the proposed model exhibits enhanced fuel economy and more suitable SOC planning in comparison with the existing energy management strategies. The results indicate that APO respectively outperforms the compared online strategies by 9.8% and 2.6% and reaches 98% energy-saving rate of the offline global optimum.

源语言英语
文章编号115031
期刊Applied Energy
269
DOI
出版状态已出版 - 1 7月 2020
已对外发布

指纹

探究 'Tackling SOC long-term dynamic for energy management of hybrid electric buses via adaptive policy optimization' 的科研主题。它们共同构成独一无二的指纹。

引用此