TY - JOUR
T1 - Dynamic scheduling for flexible job-shop with reconfigurable manufacturing cells considering dynamic job arrivals based on deep reinforcement learning
AU - Zheng, Liang
AU - Chen, Xiaodi
AU - Zhuang, Cunbo
AU - Liu, Jianhua
AU - Zhang, Yongyang
AU - Lai, Lijuan
N1 - Publisher Copyright:
© 2025 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2025
Y1 - 2025
N2 - In smart manufacturing, uncertainties in discrete manufacturing processes are increasing, making efficient dynamic scheduling a critical challenge. This paper addresses the Dynamic Flexible Job Shop Scheduling Problem with Reconfigurable Manufacturing Cells (DFJSP-RMC), considering dynamic job arrivals and aiming to minimise makespan, tardiness, and operation instability. To tackle this problem, a Noisy Dueling Double DQN with Prioritized Experience Replay (ND3QNP) algorithm is proposed. First, the DFJSP-RMC is formulated as a mixed-integer programming problem and transformed into a Markov decision process. Next, 28 state features are designed based on job attributes, reconfigurable manufacturing cell characteristics, and dynamic factors, while 33 actions are derived from heuristic dispatching rules. A reward function is then constructed by integrating makespan, tardiness, and instability. To mitigate overestimation in the original DQN, double Q-learning, a duelling network, and a noisy network are incorporated. Additionally, prioritised experience replay is employed to enhance learning efficiency. Furthermore, an operation insertion strategy is introduced, significantly improving the algorithm's optimisation performance. Finally, experimental results demonstrate that the proposed ND3QNP algorithm outperforms classical dispatching rules and state-of-the-art deep reinforcement learning approaches in terms of makespan, total tardiness, and operation instability, showcasing its robustness in dynamic and reconfigurable manufacturing environments.
AB - In smart manufacturing, uncertainties in discrete manufacturing processes are increasing, making efficient dynamic scheduling a critical challenge. This paper addresses the Dynamic Flexible Job Shop Scheduling Problem with Reconfigurable Manufacturing Cells (DFJSP-RMC), considering dynamic job arrivals and aiming to minimise makespan, tardiness, and operation instability. To tackle this problem, a Noisy Dueling Double DQN with Prioritized Experience Replay (ND3QNP) algorithm is proposed. First, the DFJSP-RMC is formulated as a mixed-integer programming problem and transformed into a Markov decision process. Next, 28 state features are designed based on job attributes, reconfigurable manufacturing cell characteristics, and dynamic factors, while 33 actions are derived from heuristic dispatching rules. A reward function is then constructed by integrating makespan, tardiness, and instability. To mitigate overestimation in the original DQN, double Q-learning, a duelling network, and a noisy network are incorporated. Additionally, prioritised experience replay is employed to enhance learning efficiency. Furthermore, an operation insertion strategy is introduced, significantly improving the algorithm's optimisation performance. Finally, experimental results demonstrate that the proposed ND3QNP algorithm outperforms classical dispatching rules and state-of-the-art deep reinforcement learning approaches in terms of makespan, total tardiness, and operation instability, showcasing its robustness in dynamic and reconfigurable manufacturing environments.
KW - deep reinforcement learning
KW - Dynamic flexible job-shop scheduling
KW - dynamic job arrival
KW - Noisy Dueling Double DQN
KW - reconfigurable manufacturing cells
UR - http://www.scopus.com/inward/record.url?scp=105004268716&partnerID=8YFLogxK
U2 - 10.1080/00207543.2025.2497961
DO - 10.1080/00207543.2025.2497961
M3 - Article
AN - SCOPUS:105004268716
SN - 0020-7543
JO - International Journal of Production Research
JF - International Journal of Production Research
ER -