TY - GEN
T1 - A Reinforcement Learning Approach for Integrated Scheduling in Automated Container Terminals
AU - Zhang, Zhanluo
AU - Zhuang, Zilong
AU - Qin, Wei
AU - Fang, Huaijin
AU - Lan, Shulin
AU - Yang, Chen
AU - Tian, Yu
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Automated container terminals are complex systems with multiple interactions and high dynamic characteristics. Integrated scheduling is expected to improve the overall efficiency. However, traditional optimization approaches such as mathematical models and meta-heuristic algorithms failed to tackle high dynamics. A reinforcement learning approach based on the scheduling network method is presented in this paper. Network-based heuristic rules are introduced into the action space, and a novel state definition that integrates local and global information about the scheduling problem is proposed. Group training and group validating strategies are adopted to test the generalization ability. Numerical experiment results reveal that the proposed approach converges to a high level and maintains good performance on unseen instances. Compared to the selected heuristic rules, the proposed method achieves 2.37% and 6.06% better results on training and test instances, respectively.
AB - Automated container terminals are complex systems with multiple interactions and high dynamic characteristics. Integrated scheduling is expected to improve the overall efficiency. However, traditional optimization approaches such as mathematical models and meta-heuristic algorithms failed to tackle high dynamics. A reinforcement learning approach based on the scheduling network method is presented in this paper. Network-based heuristic rules are introduced into the action space, and a novel state definition that integrates local and global information about the scheduling problem is proposed. Group training and group validating strategies are adopted to test the generalization ability. Numerical experiment results reveal that the proposed approach converges to a high level and maintains good performance on unseen instances. Compared to the selected heuristic rules, the proposed method achieves 2.37% and 6.06% better results on training and test instances, respectively.
KW - Automated container terminal
KW - integrated scheduling
KW - network scheduling
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85146339145&partnerID=8YFLogxK
U2 - 10.1109/IEEM55944.2022.9989692
DO - 10.1109/IEEM55944.2022.9989692
M3 - Conference contribution
AN - SCOPUS:85146339145
T3 - IEEE International Conference on Industrial Engineering and Engineering Management
SP - 1182
EP - 1186
BT - IEEE International Conference on Industrial Engineering and Engineering Management, IEEM 2022
PB - IEEE Computer Society
T2 - 2022 IEEE International Conference on Industrial Engineering and Engineering Management, IEEM 2022
Y2 - 7 December 2022 through 10 December 2022
ER -