TY - JOUR
T1 - Real-time AGV scheduling optimisation method with deep reinforcement learning for energy-efficiency in the container terminal yard
AU - Gong, Lin
AU - Huang, Zijie
AU - Xiang, Xi
AU - Liu, Xin
N1 - Publisher Copyright:
© 2024 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2024
Y1 - 2024
N2 - The increasing vessel size and automation level have shifted the productivity bottleneck of automated container terminals from the terminal side to the yard side. Operating an automated container terminal (ACT) yard with a big number of automated guided vehicles (AGV) is challenging due to the complexity and dynamics of the system, severely affecting the operational efficiency and energy use efficiency. In this paper, a hybrid multi-AGV scheduling algorithm is proposed to minimise the energy consumption and the total makespan of AGVs in an ACT yard. This framework first models the AGV scheduling process as a Markov decision process (MDP). Furthermore, a novel scheduling algorithm called MDAS is proposed based on multi-agent deep deterministic policy gradient (MADDPG) to facilitate online real-time scheduling decision-making. Finally, simulation experiments show that the proposed method can effectively enhance the operational efficiency and energy use performance of AGVs in ACT yards of various scales by comparing with benchmarking methods.
AB - The increasing vessel size and automation level have shifted the productivity bottleneck of automated container terminals from the terminal side to the yard side. Operating an automated container terminal (ACT) yard with a big number of automated guided vehicles (AGV) is challenging due to the complexity and dynamics of the system, severely affecting the operational efficiency and energy use efficiency. In this paper, a hybrid multi-AGV scheduling algorithm is proposed to minimise the energy consumption and the total makespan of AGVs in an ACT yard. This framework first models the AGV scheduling process as a Markov decision process (MDP). Furthermore, a novel scheduling algorithm called MDAS is proposed based on multi-agent deep deterministic policy gradient (MADDPG) to facilitate online real-time scheduling decision-making. Finally, simulation experiments show that the proposed method can effectively enhance the operational efficiency and energy use performance of AGVs in ACT yards of various scales by comparing with benchmarking methods.
KW - AGV real-time scheduling
KW - actor-critic networks
KW - container terminal yard
KW - deep reinforcement learning
KW - multi-agent systems
UR - http://www.scopus.com/inward/record.url?scp=85188844848&partnerID=8YFLogxK
U2 - 10.1080/00207543.2024.2325583
DO - 10.1080/00207543.2024.2325583
M3 - Article
AN - SCOPUS:85188844848
SN - 0020-7543
VL - 62
SP - 7722
EP - 7742
JO - International Journal of Production Research
JF - International Journal of Production Research
IS - 21
ER -