Learning to Solve Multi-AGV Scheduling Problem with Pod Repositioning Optimization in RMFS

Xuan Zhou, Xiang Shi, Wenqing Chu, Jingchen Jiang, Lele Zhang, Fang Deng*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

This paper studies a novel non-deterministic polynomial-hard (NP-hard) warehouse optimization problem, the multi-automated guided vehicle (multi-AGV) scheduling problem with pod repositioning optimization (MAS-PR), in a robotic mobile fulfillment system (RMFS). A major challenge in MAS-PR is that the storage assignment of pods and the task allocation decision of AGVs are closely coupled. To address this problem, an end-to-end Problem-Specific Mask-based Deep Reinforcement Learning (PSMDRL) method is proposed in this paper. In PSMDRL, we transform the MAS-PR into a Markov Decision Process (MDP) model, and then use an attention-based network model to learn the efficient allocation and retrieval policy. A mask strategy with problem-specific knowledge and a model structure with decoder embedding are designed to further improve the efficiency and effectiveness. Experimental results demonstrate that the proposed PSMDRL is more effective and efficient than the representative algorithms in solving the MAS-PR problem.

Original languageEnglish
Title of host publicationICIT 2024 - 2024 25th International Conference on Industrial Technology
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350340266
DOIs
Publication statusPublished - 2024
Event25th IEEE International Conference on Industrial Technology, ICIT 2024 - Bristol, United Kingdom
Duration: 25 Mar 202427 Mar 2024

Publication series

NameProceedings of the IEEE International Conference on Industrial Technology
ISSN (Print)2641-0184
ISSN (Electronic)2643-2978

Conference

Conference25th IEEE International Conference on Industrial Technology, ICIT 2024
Country/TerritoryUnited Kingdom
CityBristol
Period25/03/2427/03/24

Fingerprint

Dive into the research topics of 'Learning to Solve Multi-AGV Scheduling Problem with Pod Repositioning Optimization in RMFS'. Together they form a unique fingerprint.

Cite this