Learning to Solve Pod Retrieval as Sequential Decision Making Problem

Yunfeng Fan; Fang Deng; Xiang Shi; Jing Yang

doi:10.1109/ICCA54724.2022.9831817

Learning to Solve Pod Retrieval as Sequential Decision Making Problem

Yunfeng Fan, Fang Deng, Xiang Shi, Jing Yang

School of Automation

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Citation (Scopus)

Abstract

The problem of pod retrieval in Robotic Mobile Fulfilment System (RMFS) is a key problem to improve the order picking efficiency. In such system, each robot needs to complete a set of retrieval requests, including bringing each pod from a retrieval location to a picking station and return the pod to a storage location. The objective is to minimize the total cost for each robot with all retrieval requests completed. In the previous literature, the problem was viewed as a static combinatorial optimization problem, which was commonly solved by heuristic methods. This kind of approachs often face with computational efficiency problems and are hard to satisfy the real-time requirement in complex real scenes. In this paper, we formulate the problem as a Markov Decision Process, a kind of Sequential Decision Making Problem, and then using Transformer with reinforcement learning to learn an efficient retrieval policy. The effectiveness of the method is verified by experiments.

Original language	English
Title of host publication	2022 IEEE 17th International Conference on Control and Automation, ICCA 2022
Publisher	IEEE Computer Society
Pages	220-224
Number of pages	5
ISBN (Electronic)	9781665495721
DOIs	https://doi.org/10.1109/ICCA54724.2022.9831817
Publication status	Published - 2022
Event	17th IEEE International Conference on Control and Automation, ICCA 2022 - Naples, Italy Duration: 27 Jun 2022 → 30 Jun 2022

Publication series

Name	IEEE International Conference on Control and Automation, ICCA
Volume	2022-June
ISSN (Print)	1948-3449
ISSN (Electronic)	1948-3457

Conference

Conference	17th IEEE International Conference on Control and Automation, ICCA 2022
Country/Territory	Italy
City	Naples
Period	27/06/22 → 30/06/22

Access to Document

10.1109/ICCA54724.2022.9831817

Cite this

@inproceedings{ba3da196908d4c2abbfe015602d48cf3,

title = "Learning to Solve Pod Retrieval as Sequential Decision Making Problem",

abstract = "The problem of pod retrieval in Robotic Mobile Fulfilment System (RMFS) is a key problem to improve the order picking efficiency. In such system, each robot needs to complete a set of retrieval requests, including bringing each pod from a retrieval location to a picking station and return the pod to a storage location. The objective is to minimize the total cost for each robot with all retrieval requests completed. In the previous literature, the problem was viewed as a static combinatorial optimization problem, which was commonly solved by heuristic methods. This kind of approachs often face with computational efficiency problems and are hard to satisfy the real-time requirement in complex real scenes. In this paper, we formulate the problem as a Markov Decision Process, a kind of Sequential Decision Making Problem, and then using Transformer with reinforcement learning to learn an efficient retrieval policy. The effectiveness of the method is verified by experiments.",

author = "Yunfeng Fan and Fang Deng and Xiang Shi and Jing Yang",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 17th IEEE International Conference on Control and Automation, ICCA 2022 ; Conference date: 27-06-2022 Through 30-06-2022",

year = "2022",

doi = "10.1109/ICCA54724.2022.9831817",

language = "English",

series = "IEEE International Conference on Control and Automation, ICCA",

publisher = "IEEE Computer Society",

pages = "220--224",

booktitle = "2022 IEEE 17th International Conference on Control and Automation, ICCA 2022",

address = "United States",

}

Fan, Y, Deng, F, Shi, X & Yang, J 2022, Learning to Solve Pod Retrieval as Sequential Decision Making Problem. in 2022 IEEE 17th International Conference on Control and Automation, ICCA 2022. IEEE International Conference on Control and Automation, ICCA, vol. 2022-June, IEEE Computer Society, pp. 220-224, 17th IEEE International Conference on Control and Automation, ICCA 2022, Naples, Italy, 27/06/22. https://doi.org/10.1109/ICCA54724.2022.9831817

Learning to Solve Pod Retrieval as Sequential Decision Making Problem. / Fan, Yunfeng; Deng, Fang; Shi, Xiang et al.
2022 IEEE 17th International Conference on Control and Automation, ICCA 2022. IEEE Computer Society, 2022. p. 220-224 (IEEE International Conference on Control and Automation, ICCA; Vol. 2022-June).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Learning to Solve Pod Retrieval as Sequential Decision Making Problem

AU - Fan, Yunfeng

AU - Deng, Fang

AU - Shi, Xiang

AU - Yang, Jing

PY - 2022

Y1 - 2022

N2 - The problem of pod retrieval in Robotic Mobile Fulfilment System (RMFS) is a key problem to improve the order picking efficiency. In such system, each robot needs to complete a set of retrieval requests, including bringing each pod from a retrieval location to a picking station and return the pod to a storage location. The objective is to minimize the total cost for each robot with all retrieval requests completed. In the previous literature, the problem was viewed as a static combinatorial optimization problem, which was commonly solved by heuristic methods. This kind of approachs often face with computational efficiency problems and are hard to satisfy the real-time requirement in complex real scenes. In this paper, we formulate the problem as a Markov Decision Process, a kind of Sequential Decision Making Problem, and then using Transformer with reinforcement learning to learn an efficient retrieval policy. The effectiveness of the method is verified by experiments.

AB - The problem of pod retrieval in Robotic Mobile Fulfilment System (RMFS) is a key problem to improve the order picking efficiency. In such system, each robot needs to complete a set of retrieval requests, including bringing each pod from a retrieval location to a picking station and return the pod to a storage location. The objective is to minimize the total cost for each robot with all retrieval requests completed. In the previous literature, the problem was viewed as a static combinatorial optimization problem, which was commonly solved by heuristic methods. This kind of approachs often face with computational efficiency problems and are hard to satisfy the real-time requirement in complex real scenes. In this paper, we formulate the problem as a Markov Decision Process, a kind of Sequential Decision Making Problem, and then using Transformer with reinforcement learning to learn an efficient retrieval policy. The effectiveness of the method is verified by experiments.

UR - http://www.scopus.com/inward/record.url?scp=85135825805&partnerID=8YFLogxK

U2 - 10.1109/ICCA54724.2022.9831817

DO - 10.1109/ICCA54724.2022.9831817

M3 - Conference contribution

AN - SCOPUS:85135825805

T3 - IEEE International Conference on Control and Automation, ICCA

SP - 220

EP - 224

BT - 2022 IEEE 17th International Conference on Control and Automation, ICCA 2022

PB - IEEE Computer Society

T2 - 17th IEEE International Conference on Control and Automation, ICCA 2022

Y2 - 27 June 2022 through 30 June 2022

ER -

Learning to Solve Pod Retrieval as Sequential Decision Making Problem

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this