A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time

Shuai Yang; Hongwei Guo; Jiaqi Huang; Kexian Han

doi:10.1145/3690931.3690993

A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time

Shuai Yang, Hongwei Guo^*, Jiaqi Huang, Kexian Han

^*Corresponding author for this work

School of Mechanical Engineering

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

The job shop scheduling problem is a common challenge in intelligent manufacturing. In a real workshop environment, parameters like processing time often change dynamically. Scheduling strategies must be adjusted flexibly and quickly to match the current state. However, traditional methods only find the optimal solution for a specific instance. When environment parameters change, recalculations are required, leading to high time costs. To address these problems, a scheduling method called S2S-AC, based on deep reinforcement learning, is proposed to efficiently solve the dynamic job shop scheduling problem with variable processing times. In the proposed method, JSSP is modeled as a sequential decision-making problem using a Markov Decision Process. A new state set model, including four static states and two dynamic states, is designed with operations as the action set. An end-to-end framework that combines the Pointer Network model with the A2C algorithm is used to construct the DRL network, which is trained with multiple samples. The trained network directly outputs the scheduling strategy for new instances without requiring retraining. In static experiments, the effectiveness of S2S-AC is verified by comparing its solution results with those of SPT, LPT, MTWR, and the genetic algorithm on benchmark instances. In dynamic experiments, S2SAC achieved the best solution results in all randomly generated test instances based on instance ft10, compared to the above methods, with relatively short solution times.

Original language	English
Title of host publication	Proceedings of 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024
Publisher	Association for Computing Machinery
Pages	368-374
Number of pages	7
ISBN (Electronic)	9798400710049
DOIs	https://doi.org/10.1145/3690931.3690993
Publication status	Published - 4 Oct 2024
Event	4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024 - Zhuhai, China Duration: 19 Jul 2024 → 21 Jul 2024

Publication series

Name	ACM International Conference Proceeding Series

Conference

Conference	4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024
Country/Territory	China
City	Zhuhai
Period	19/07/24 → 21/07/24

Keywords

Deep reinforcement learning
Dynamic scheduling
Job shop scheduling
Pointer network

Access to Document

10.1145/3690931.3690993

Cite this

Yang, S., Guo, H., Huang, J., & Han, K. (2024). A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time. In Proceedings of 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024 (pp. 368-374). (ACM International Conference Proceeding Series). Association for Computing Machinery. https://doi.org/10.1145/3690931.3690993

Yang, Shuai ; Guo, Hongwei ; Huang, Jiaqi et al. / A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time. Proceedings of 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024. Association for Computing Machinery, 2024. pp. 368-374 (ACM International Conference Proceeding Series).

@inproceedings{6e196b2a24434f73a447dff86cd39838,

title = "A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time",

abstract = "The job shop scheduling problem is a common challenge in intelligent manufacturing. In a real workshop environment, parameters like processing time often change dynamically. Scheduling strategies must be adjusted flexibly and quickly to match the current state. However, traditional methods only find the optimal solution for a specific instance. When environment parameters change, recalculations are required, leading to high time costs. To address these problems, a scheduling method called S2S-AC, based on deep reinforcement learning, is proposed to efficiently solve the dynamic job shop scheduling problem with variable processing times. In the proposed method, JSSP is modeled as a sequential decision-making problem using a Markov Decision Process. A new state set model, including four static states and two dynamic states, is designed with operations as the action set. An end-to-end framework that combines the Pointer Network model with the A2C algorithm is used to construct the DRL network, which is trained with multiple samples. The trained network directly outputs the scheduling strategy for new instances without requiring retraining. In static experiments, the effectiveness of S2S-AC is verified by comparing its solution results with those of SPT, LPT, MTWR, and the genetic algorithm on benchmark instances. In dynamic experiments, S2SAC achieved the best solution results in all randomly generated test instances based on instance ft10, compared to the above methods, with relatively short solution times.",

keywords = "Deep reinforcement learning, Dynamic scheduling, Job shop scheduling, Pointer network",

author = "Shuai Yang and Hongwei Guo and Jiaqi Huang and Kexian Han",

note = "Publisher Copyright: {\textcopyright} 2024 Copyright held by the owner/author(s).; 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024 ; Conference date: 19-07-2024 Through 21-07-2024",

year = "2024",

month = oct,

day = "4",

doi = "10.1145/3690931.3690993",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery",

pages = "368--374",

booktitle = "Proceedings of 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024",

}

Yang, S, Guo, H, Huang, J & Han, K 2024, A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time. in Proceedings of 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024. ACM International Conference Proceeding Series, Association for Computing Machinery, pp. 368-374, 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024, Zhuhai, China, 19/07/24. https://doi.org/10.1145/3690931.3690993

A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time. / Yang, Shuai; Guo, Hongwei; Huang, Jiaqi et al.
Proceedings of 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024. Association for Computing Machinery, 2024. p. 368-374 (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time

AU - Yang, Shuai

AU - Guo, Hongwei

AU - Huang, Jiaqi

AU - Han, Kexian

PY - 2024/10/4

Y1 - 2024/10/4

N2 - The job shop scheduling problem is a common challenge in intelligent manufacturing. In a real workshop environment, parameters like processing time often change dynamically. Scheduling strategies must be adjusted flexibly and quickly to match the current state. However, traditional methods only find the optimal solution for a specific instance. When environment parameters change, recalculations are required, leading to high time costs. To address these problems, a scheduling method called S2S-AC, based on deep reinforcement learning, is proposed to efficiently solve the dynamic job shop scheduling problem with variable processing times. In the proposed method, JSSP is modeled as a sequential decision-making problem using a Markov Decision Process. A new state set model, including four static states and two dynamic states, is designed with operations as the action set. An end-to-end framework that combines the Pointer Network model with the A2C algorithm is used to construct the DRL network, which is trained with multiple samples. The trained network directly outputs the scheduling strategy for new instances without requiring retraining. In static experiments, the effectiveness of S2S-AC is verified by comparing its solution results with those of SPT, LPT, MTWR, and the genetic algorithm on benchmark instances. In dynamic experiments, S2SAC achieved the best solution results in all randomly generated test instances based on instance ft10, compared to the above methods, with relatively short solution times.

AB - The job shop scheduling problem is a common challenge in intelligent manufacturing. In a real workshop environment, parameters like processing time often change dynamically. Scheduling strategies must be adjusted flexibly and quickly to match the current state. However, traditional methods only find the optimal solution for a specific instance. When environment parameters change, recalculations are required, leading to high time costs. To address these problems, a scheduling method called S2S-AC, based on deep reinforcement learning, is proposed to efficiently solve the dynamic job shop scheduling problem with variable processing times. In the proposed method, JSSP is modeled as a sequential decision-making problem using a Markov Decision Process. A new state set model, including four static states and two dynamic states, is designed with operations as the action set. An end-to-end framework that combines the Pointer Network model with the A2C algorithm is used to construct the DRL network, which is trained with multiple samples. The trained network directly outputs the scheduling strategy for new instances without requiring retraining. In static experiments, the effectiveness of S2S-AC is verified by comparing its solution results with those of SPT, LPT, MTWR, and the genetic algorithm on benchmark instances. In dynamic experiments, S2SAC achieved the best solution results in all randomly generated test instances based on instance ft10, compared to the above methods, with relatively short solution times.

KW - Deep reinforcement learning

KW - Dynamic scheduling

KW - Job shop scheduling

KW - Pointer network

UR - http://www.scopus.com/inward/record.url?scp=85212587515&partnerID=8YFLogxK

U2 - 10.1145/3690931.3690993

DO - 10.1145/3690931.3690993

M3 - Conference contribution

AN - SCOPUS:85212587515

T3 - ACM International Conference Proceeding Series

SP - 368

EP - 374

BT - Proceedings of 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024

PB - Association for Computing Machinery

T2 - 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024

Y2 - 19 July 2024 through 21 July 2024

ER -

Yang S, Guo H, Huang J, Han K. A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time. In Proceedings of 2024 4th International Conference on Artificial Intelligence, Automation and High Performance Computing, AIAHPC 2024. Association for Computing Machinery. 2024. p. 368-374. (ACM International Conference Proceeding Series). doi: 10.1145/3690931.3690993

A deep reinforcement learning based approach for dynamic job shop scheduling considering variable processing time

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this