Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient

Bin Zhang; Jinlong Wu; Yuan Zou; Xudong Zhang

doi:10.1007/978-981-16-2090-4_53

Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient

Bin Zhang^*, Jinlong Wu, Yuan Zou, Xudong Zhang

^*Corresponding author for this work

School of Mechanical Engineering

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Citation (Scopus)

Abstract

Reinforcement learning (RL) has been applied to energy management of hybrid electric vehicles to synthesize the system efficiency and adaptability. However, the existing RL-based energy management strategies still suffer the “curse of dimensionality” due to the discretization of the state and control action variables. To cure this disadvantage, a continuous RL-based energy management adopting deep deterministic policy gradient (DDPG) is proposed and applied to a series hybrid electric tracked vehicle. First, DDPG-based energy management strategy is put forward, where two sets of neural networks are adopted to parameterize strategy and approximate the action-value function respectively to eliminate the discretization. In addition, an online updating framework of energy management is carried out to increase the adaptability of the energy management strategy. The simulation results show that the fuel consumption of the online updating strategy is 5.9% lower than that of the stationary strategy, and is close to that of dynamic programming benchmark strategy. Besides, the computational burden is significantly reduced and can be implemented in real-time.

Original language	English
Title of host publication	Proceedings of China SAE Congress 2020
Subtitle of host publication	Selected Papers
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	879-893
Number of pages	15
ISBN (Print)	9789811620898
DOIs	https://doi.org/10.1007/978-981-16-2090-4_53
Publication status	Published - 2022
Event	China SAE Congress, 2020 - Shanghai, China Duration: 27 Oct 2020 → 29 Oct 2020

Publication series

Name	Lecture Notes in Electrical Engineering
Volume	769
ISSN (Print)	1876-1100
ISSN (Electronic)	1876-1119

Conference

Conference	China SAE Congress, 2020
Country/Territory	China
City	Shanghai
Period	27/10/20 → 29/10/20

Keywords

Deep reinforcement learning
Energy management
Hybrid electric tracked vehicle

Access to Document

10.1007/978-981-16-2090-4_53

Cite this

Zhang, B., Wu, J., Zou, Y., & Zhang, X. (2022). Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient. In Proceedings of China SAE Congress 2020: Selected Papers (pp. 879-893). (Lecture Notes in Electrical Engineering; Vol. 769). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-16-2090-4_53

@inproceedings{b864c644a31146be97aa8ea589efda3a,

title = "Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient",

abstract = "Reinforcement learning (RL) has been applied to energy management of hybrid electric vehicles to synthesize the system efficiency and adaptability. However, the existing RL-based energy management strategies still suffer the “curse of dimensionality” due to the discretization of the state and control action variables. To cure this disadvantage, a continuous RL-based energy management adopting deep deterministic policy gradient (DDPG) is proposed and applied to a series hybrid electric tracked vehicle. First, DDPG-based energy management strategy is put forward, where two sets of neural networks are adopted to parameterize strategy and approximate the action-value function respectively to eliminate the discretization. In addition, an online updating framework of energy management is carried out to increase the adaptability of the energy management strategy. The simulation results show that the fuel consumption of the online updating strategy is 5.9% lower than that of the stationary strategy, and is close to that of dynamic programming benchmark strategy. Besides, the computational burden is significantly reduced and can be implemented in real-time.",

keywords = "Deep reinforcement learning, Energy management, Hybrid electric tracked vehicle",

author = "Bin Zhang and Jinlong Wu and Yuan Zou and Xudong Zhang",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.; China SAE Congress, 2020 ; Conference date: 27-10-2020 Through 29-10-2020",

year = "2022",

doi = "10.1007/978-981-16-2090-4_53",

language = "English",

isbn = "9789811620898",

series = "Lecture Notes in Electrical Engineering",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "879--893",

booktitle = "Proceedings of China SAE Congress 2020",

address = "Germany",

}

Zhang, B, Wu, J, Zou, Y & Zhang, X 2022, Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient. in Proceedings of China SAE Congress 2020: Selected Papers. Lecture Notes in Electrical Engineering, vol. 769, Springer Science and Business Media Deutschland GmbH, pp. 879-893, China SAE Congress, 2020, Shanghai, China, 27/10/20. https://doi.org/10.1007/978-981-16-2090-4_53

Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient. / Zhang, Bin; Wu, Jinlong; Zou, Yuan et al.
Proceedings of China SAE Congress 2020: Selected Papers. Springer Science and Business Media Deutschland GmbH, 2022. p. 879-893 (Lecture Notes in Electrical Engineering; Vol. 769).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient

AU - Zhang, Bin

AU - Wu, Jinlong

AU - Zou, Yuan

AU - Zhang, Xudong

PY - 2022

Y1 - 2022

N2 - Reinforcement learning (RL) has been applied to energy management of hybrid electric vehicles to synthesize the system efficiency and adaptability. However, the existing RL-based energy management strategies still suffer the “curse of dimensionality” due to the discretization of the state and control action variables. To cure this disadvantage, a continuous RL-based energy management adopting deep deterministic policy gradient (DDPG) is proposed and applied to a series hybrid electric tracked vehicle. First, DDPG-based energy management strategy is put forward, where two sets of neural networks are adopted to parameterize strategy and approximate the action-value function respectively to eliminate the discretization. In addition, an online updating framework of energy management is carried out to increase the adaptability of the energy management strategy. The simulation results show that the fuel consumption of the online updating strategy is 5.9% lower than that of the stationary strategy, and is close to that of dynamic programming benchmark strategy. Besides, the computational burden is significantly reduced and can be implemented in real-time.

AB - Reinforcement learning (RL) has been applied to energy management of hybrid electric vehicles to synthesize the system efficiency and adaptability. However, the existing RL-based energy management strategies still suffer the “curse of dimensionality” due to the discretization of the state and control action variables. To cure this disadvantage, a continuous RL-based energy management adopting deep deterministic policy gradient (DDPG) is proposed and applied to a series hybrid electric tracked vehicle. First, DDPG-based energy management strategy is put forward, where two sets of neural networks are adopted to parameterize strategy and approximate the action-value function respectively to eliminate the discretization. In addition, an online updating framework of energy management is carried out to increase the adaptability of the energy management strategy. The simulation results show that the fuel consumption of the online updating strategy is 5.9% lower than that of the stationary strategy, and is close to that of dynamic programming benchmark strategy. Besides, the computational burden is significantly reduced and can be implemented in real-time.

KW - Deep reinforcement learning

KW - Energy management

KW - Hybrid electric tracked vehicle

UR - http://www.scopus.com/inward/record.url?scp=85124001816&partnerID=8YFLogxK

U2 - 10.1007/978-981-16-2090-4_53

DO - 10.1007/978-981-16-2090-4_53

M3 - Conference contribution

AN - SCOPUS:85124001816

SN - 9789811620898

T3 - Lecture Notes in Electrical Engineering

SP - 879

EP - 893

BT - Proceedings of China SAE Congress 2020

PB - Springer Science and Business Media Deutschland GmbH

T2 - China SAE Congress, 2020

Y2 - 27 October 2020 through 29 October 2020

ER -

Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this