Deep reinforce learning for joint optimization of condition-based maintenance and spare ordering

Shengang Hao; Jun Zheng; Jie Yang; Haipeng Sun; Quanxin Zhang; Li Zhang; Nan Jiang; Yuanzhang Li

doi:10.1016/j.ins.2023.03.064

Deep reinforce learning for joint optimization of condition-based maintenance and spare ordering

Shengang Hao, Jun Zheng, Jie Yang, Haipeng Sun, Quanxin Zhang, Li Zhang^*, Nan Jiang, Yuanzhang Li

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

13 Citations (Scopus)

Abstract

Condition-based maintenance (CBM) policy can avoid premature or late maintenance and reduce system failures and maintenance costs. Most existing CBM studies cannot solve the dimensional disaster problem in multi-component complex systems. Only some studies consider the constraint of maintenance resources when searching for the optimal maintenance policy, which is hard to apply to practical maintenance. This paper studies the joint optimization of the CBM policy and spare components inventory for the multi-component system in large state and action spaces. We use Markov Decision Process to model it and propose an improved deep reinforcement learning algorithm based on the stochastic policy and actor-critic framework. In this algorithm, factorization decomposes the system action into the linear combination of each component's action. The experimental results show that the algorithm proposed in this paper has better time performance and lower system cost compared with other benchmark algorithms. The training time of the former is only 28.5% and 9.12% of that of PPO and DQN algorithms, and the corresponding system cost is decreased by 17.39% and 27.95%, respectively. At the same time, our algorithm has good scalability and is suitable for solving Markov decision-making problems in large-scale state and action space.

Original language	English
Pages (from-to)	85-100
Number of pages	16
Journal	Information Sciences
Volume	634
DOIs	https://doi.org/10.1016/j.ins.2023.03.064
Publication status	Published - Jul 2023

Keywords

Actor-critic framework
Condition-based maintenance
Deep reinforcement learning
Markov decision process
Stochastic policy

Access to Document

10.1016/j.ins.2023.03.064

Cite this

Hao, S., Zheng, J., Yang, J., Sun, H., Zhang, Q., Zhang, L., Jiang, N., & Li, Y. (2023). Deep reinforce learning for joint optimization of condition-based maintenance and spare ordering. Information Sciences, 634, 85-100. https://doi.org/10.1016/j.ins.2023.03.064

@article{fce2df742ee34c1397b7def42d6b0634,

title = "Deep reinforce learning for joint optimization of condition-based maintenance and spare ordering",

abstract = "Condition-based maintenance (CBM) policy can avoid premature or late maintenance and reduce system failures and maintenance costs. Most existing CBM studies cannot solve the dimensional disaster problem in multi-component complex systems. Only some studies consider the constraint of maintenance resources when searching for the optimal maintenance policy, which is hard to apply to practical maintenance. This paper studies the joint optimization of the CBM policy and spare components inventory for the multi-component system in large state and action spaces. We use Markov Decision Process to model it and propose an improved deep reinforcement learning algorithm based on the stochastic policy and actor-critic framework. In this algorithm, factorization decomposes the system action into the linear combination of each component's action. The experimental results show that the algorithm proposed in this paper has better time performance and lower system cost compared with other benchmark algorithms. The training time of the former is only 28.5% and 9.12% of that of PPO and DQN algorithms, and the corresponding system cost is decreased by 17.39% and 27.95%, respectively. At the same time, our algorithm has good scalability and is suitable for solving Markov decision-making problems in large-scale state and action space.",

keywords = "Actor-critic framework, Condition-based maintenance, Deep reinforcement learning, Markov decision process, Stochastic policy",

author = "Shengang Hao and Jun Zheng and Jie Yang and Haipeng Sun and Quanxin Zhang and Li Zhang and Nan Jiang and Yuanzhang Li",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Inc.",

year = "2023",

month = jul,

doi = "10.1016/j.ins.2023.03.064",

language = "English",

volume = "634",

pages = "85--100",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Deep reinforce learning for joint optimization of condition-based maintenance and spare ordering

AU - Hao, Shengang

AU - Zheng, Jun

AU - Yang, Jie

AU - Sun, Haipeng

AU - Zhang, Quanxin

AU - Zhang, Li

AU - Jiang, Nan

AU - Li, Yuanzhang

PY - 2023/7

Y1 - 2023/7

N2 - Condition-based maintenance (CBM) policy can avoid premature or late maintenance and reduce system failures and maintenance costs. Most existing CBM studies cannot solve the dimensional disaster problem in multi-component complex systems. Only some studies consider the constraint of maintenance resources when searching for the optimal maintenance policy, which is hard to apply to practical maintenance. This paper studies the joint optimization of the CBM policy and spare components inventory for the multi-component system in large state and action spaces. We use Markov Decision Process to model it and propose an improved deep reinforcement learning algorithm based on the stochastic policy and actor-critic framework. In this algorithm, factorization decomposes the system action into the linear combination of each component's action. The experimental results show that the algorithm proposed in this paper has better time performance and lower system cost compared with other benchmark algorithms. The training time of the former is only 28.5% and 9.12% of that of PPO and DQN algorithms, and the corresponding system cost is decreased by 17.39% and 27.95%, respectively. At the same time, our algorithm has good scalability and is suitable for solving Markov decision-making problems in large-scale state and action space.

AB - Condition-based maintenance (CBM) policy can avoid premature or late maintenance and reduce system failures and maintenance costs. Most existing CBM studies cannot solve the dimensional disaster problem in multi-component complex systems. Only some studies consider the constraint of maintenance resources when searching for the optimal maintenance policy, which is hard to apply to practical maintenance. This paper studies the joint optimization of the CBM policy and spare components inventory for the multi-component system in large state and action spaces. We use Markov Decision Process to model it and propose an improved deep reinforcement learning algorithm based on the stochastic policy and actor-critic framework. In this algorithm, factorization decomposes the system action into the linear combination of each component's action. The experimental results show that the algorithm proposed in this paper has better time performance and lower system cost compared with other benchmark algorithms. The training time of the former is only 28.5% and 9.12% of that of PPO and DQN algorithms, and the corresponding system cost is decreased by 17.39% and 27.95%, respectively. At the same time, our algorithm has good scalability and is suitable for solving Markov decision-making problems in large-scale state and action space.

KW - Actor-critic framework

KW - Condition-based maintenance

KW - Deep reinforcement learning

KW - Markov decision process

KW - Stochastic policy

UR - http://www.scopus.com/inward/record.url?scp=85150448619&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2023.03.064

DO - 10.1016/j.ins.2023.03.064

M3 - Article

AN - SCOPUS:85150448619

SN - 0020-0255

VL - 634

SP - 85

EP - 100

JO - Information Sciences

JF - Information Sciences

ER -

Deep reinforce learning for joint optimization of condition-based maintenance and spare ordering

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this