TY - GEN
T1 - Learning to Solve Multi-AGV Scheduling Problem with Pod Repositioning Optimization in RMFS
AU - Zhou, Xuan
AU - Shi, Xiang
AU - Chu, Wenqing
AU - Jiang, Jingchen
AU - Zhang, Lele
AU - Deng, Fang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This paper studies a novel non-deterministic polynomial-hard (NP-hard) warehouse optimization problem, the multi-automated guided vehicle (multi-AGV) scheduling problem with pod repositioning optimization (MAS-PR), in a robotic mobile fulfillment system (RMFS). A major challenge in MAS-PR is that the storage assignment of pods and the task allocation decision of AGVs are closely coupled. To address this problem, an end-to-end Problem-Specific Mask-based Deep Reinforcement Learning (PSMDRL) method is proposed in this paper. In PSMDRL, we transform the MAS-PR into a Markov Decision Process (MDP) model, and then use an attention-based network model to learn the efficient allocation and retrieval policy. A mask strategy with problem-specific knowledge and a model structure with decoder embedding are designed to further improve the efficiency and effectiveness. Experimental results demonstrate that the proposed PSMDRL is more effective and efficient than the representative algorithms in solving the MAS-PR problem.
AB - This paper studies a novel non-deterministic polynomial-hard (NP-hard) warehouse optimization problem, the multi-automated guided vehicle (multi-AGV) scheduling problem with pod repositioning optimization (MAS-PR), in a robotic mobile fulfillment system (RMFS). A major challenge in MAS-PR is that the storage assignment of pods and the task allocation decision of AGVs are closely coupled. To address this problem, an end-to-end Problem-Specific Mask-based Deep Reinforcement Learning (PSMDRL) method is proposed in this paper. In PSMDRL, we transform the MAS-PR into a Markov Decision Process (MDP) model, and then use an attention-based network model to learn the efficient allocation and retrieval policy. A mask strategy with problem-specific knowledge and a model structure with decoder embedding are designed to further improve the efficiency and effectiveness. Experimental results demonstrate that the proposed PSMDRL is more effective and efficient than the representative algorithms in solving the MAS-PR problem.
UR - http://www.scopus.com/inward/record.url?scp=85195791329&partnerID=8YFLogxK
U2 - 10.1109/ICIT58233.2024.10541015
DO - 10.1109/ICIT58233.2024.10541015
M3 - Conference contribution
AN - SCOPUS:85195791329
T3 - Proceedings of the IEEE International Conference on Industrial Technology
BT - ICIT 2024 - 2024 25th International Conference on Industrial Technology
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE International Conference on Industrial Technology, ICIT 2024
Y2 - 25 March 2024 through 27 March 2024
ER -