Learning to Solve Pod Retrieval as Sequential Decision Making Problem

Yunfeng Fan, Fang Deng, Xiang Shi, Jing Yang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

The problem of pod retrieval in Robotic Mobile Fulfilment System (RMFS) is a key problem to improve the order picking efficiency. In such system, each robot needs to complete a set of retrieval requests, including bringing each pod from a retrieval location to a picking station and return the pod to a storage location. The objective is to minimize the total cost for each robot with all retrieval requests completed. In the previous literature, the problem was viewed as a static combinatorial optimization problem, which was commonly solved by heuristic methods. This kind of approachs often face with computational efficiency problems and are hard to satisfy the real-time requirement in complex real scenes. In this paper, we formulate the problem as a Markov Decision Process, a kind of Sequential Decision Making Problem, and then using Transformer with reinforcement learning to learn an efficient retrieval policy. The effectiveness of the method is verified by experiments.

Original languageEnglish
Title of host publication2022 IEEE 17th International Conference on Control and Automation, ICCA 2022
PublisherIEEE Computer Society
Pages220-224
Number of pages5
ISBN (Electronic)9781665495721
DOIs
Publication statusPublished - 2022
Event17th IEEE International Conference on Control and Automation, ICCA 2022 - Naples, Italy
Duration: 27 Jun 202230 Jun 2022

Publication series

NameIEEE International Conference on Control and Automation, ICCA
Volume2022-June
ISSN (Print)1948-3449
ISSN (Electronic)1948-3457

Conference

Conference17th IEEE International Conference on Control and Automation, ICCA 2022
Country/TerritoryItaly
CityNaples
Period27/06/2230/06/22

Fingerprint

Dive into the research topics of 'Learning to Solve Pod Retrieval as Sequential Decision Making Problem'. Together they form a unique fingerprint.

Cite this