TY - GEN
T1 - Learning to Solve Pod Retrieval as Sequential Decision Making Problem
AU - Fan, Yunfeng
AU - Deng, Fang
AU - Shi, Xiang
AU - Yang, Jing
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - The problem of pod retrieval in Robotic Mobile Fulfilment System (RMFS) is a key problem to improve the order picking efficiency. In such system, each robot needs to complete a set of retrieval requests, including bringing each pod from a retrieval location to a picking station and return the pod to a storage location. The objective is to minimize the total cost for each robot with all retrieval requests completed. In the previous literature, the problem was viewed as a static combinatorial optimization problem, which was commonly solved by heuristic methods. This kind of approachs often face with computational efficiency problems and are hard to satisfy the real-time requirement in complex real scenes. In this paper, we formulate the problem as a Markov Decision Process, a kind of Sequential Decision Making Problem, and then using Transformer with reinforcement learning to learn an efficient retrieval policy. The effectiveness of the method is verified by experiments.
AB - The problem of pod retrieval in Robotic Mobile Fulfilment System (RMFS) is a key problem to improve the order picking efficiency. In such system, each robot needs to complete a set of retrieval requests, including bringing each pod from a retrieval location to a picking station and return the pod to a storage location. The objective is to minimize the total cost for each robot with all retrieval requests completed. In the previous literature, the problem was viewed as a static combinatorial optimization problem, which was commonly solved by heuristic methods. This kind of approachs often face with computational efficiency problems and are hard to satisfy the real-time requirement in complex real scenes. In this paper, we formulate the problem as a Markov Decision Process, a kind of Sequential Decision Making Problem, and then using Transformer with reinforcement learning to learn an efficient retrieval policy. The effectiveness of the method is verified by experiments.
UR - http://www.scopus.com/inward/record.url?scp=85135825805&partnerID=8YFLogxK
U2 - 10.1109/ICCA54724.2022.9831817
DO - 10.1109/ICCA54724.2022.9831817
M3 - Conference contribution
AN - SCOPUS:85135825805
T3 - IEEE International Conference on Control and Automation, ICCA
SP - 220
EP - 224
BT - 2022 IEEE 17th International Conference on Control and Automation, ICCA 2022
PB - IEEE Computer Society
T2 - 17th IEEE International Conference on Control and Automation, ICCA 2022
Y2 - 27 June 2022 through 30 June 2022
ER -