Real-time AGV scheduling optimisation method with deep reinforcement learning for energy-efficiency in the container terminal yard

Lin Gong; Zijie Huang; Xi Xiang; Xin Liu

doi:10.1080/00207543.2024.2325583

Real-time AGV scheduling optimisation method with deep reinforcement learning for energy-efficiency in the container terminal yard

Lin Gong, Zijie Huang, Xi Xiang, Xin Liu^*

^*Corresponding author for this work

School of Mechanical Engineering

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

6 Citations (Scopus)

Abstract

The increasing vessel size and automation level have shifted the productivity bottleneck of automated container terminals from the terminal side to the yard side. Operating an automated container terminal (ACT) yard with a big number of automated guided vehicles (AGV) is challenging due to the complexity and dynamics of the system, severely affecting the operational efficiency and energy use efficiency. In this paper, a hybrid multi-AGV scheduling algorithm is proposed to minimise the energy consumption and the total makespan of AGVs in an ACT yard. This framework first models the AGV scheduling process as a Markov decision process (MDP). Furthermore, a novel scheduling algorithm called MDAS is proposed based on multi-agent deep deterministic policy gradient (MADDPG) to facilitate online real-time scheduling decision-making. Finally, simulation experiments show that the proposed method can effectively enhance the operational efficiency and energy use performance of AGVs in ACT yards of various scales by comparing with benchmarking methods.

Original language	English
Pages (from-to)	7722-7742
Number of pages	21
Journal	International Journal of Production Research
Volume	62
Issue number	21
DOIs	https://doi.org/10.1080/00207543.2024.2325583
Publication status	Published - 2024

Keywords

AGV real-time scheduling
actor-critic networks
container terminal yard
deep reinforcement learning
multi-agent systems

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1080/00207543.2024.2325583

Cite this

Gong, L., Huang, Z., Xiang, X., & Liu, X. (2024). Real-time AGV scheduling optimisation method with deep reinforcement learning for energy-efficiency in the container terminal yard. International Journal of Production Research, 62(21), 7722-7742. https://doi.org/10.1080/00207543.2024.2325583

@article{385e647ccd104ceaa96ba136bcdc2e75,

title = "Real-time AGV scheduling optimisation method with deep reinforcement learning for energy-efficiency in the container terminal yard",

abstract = "The increasing vessel size and automation level have shifted the productivity bottleneck of automated container terminals from the terminal side to the yard side. Operating an automated container terminal (ACT) yard with a big number of automated guided vehicles (AGV) is challenging due to the complexity and dynamics of the system, severely affecting the operational efficiency and energy use efficiency. In this paper, a hybrid multi-AGV scheduling algorithm is proposed to minimise the energy consumption and the total makespan of AGVs in an ACT yard. This framework first models the AGV scheduling process as a Markov decision process (MDP). Furthermore, a novel scheduling algorithm called MDAS is proposed based on multi-agent deep deterministic policy gradient (MADDPG) to facilitate online real-time scheduling decision-making. Finally, simulation experiments show that the proposed method can effectively enhance the operational efficiency and energy use performance of AGVs in ACT yards of various scales by comparing with benchmarking methods.",

keywords = "AGV real-time scheduling, actor-critic networks, container terminal yard, deep reinforcement learning, multi-agent systems",

author = "Lin Gong and Zijie Huang and Xi Xiang and Xin Liu",

note = "Publisher Copyright: {\textcopyright} 2024 Informa UK Limited, trading as Taylor & Francis Group.",

year = "2024",

doi = "10.1080/00207543.2024.2325583",

language = "English",

volume = "62",

pages = "7722--7742",

journal = "International Journal of Production Research",

issn = "0020-7543",

publisher = "Taylor and Francis Ltd.",

number = "21",

}

TY - JOUR

T1 - Real-time AGV scheduling optimisation method with deep reinforcement learning for energy-efficiency in the container terminal yard

AU - Gong, Lin

AU - Huang, Zijie

AU - Xiang, Xi

AU - Liu, Xin

PY - 2024

Y1 - 2024

N2 - The increasing vessel size and automation level have shifted the productivity bottleneck of automated container terminals from the terminal side to the yard side. Operating an automated container terminal (ACT) yard with a big number of automated guided vehicles (AGV) is challenging due to the complexity and dynamics of the system, severely affecting the operational efficiency and energy use efficiency. In this paper, a hybrid multi-AGV scheduling algorithm is proposed to minimise the energy consumption and the total makespan of AGVs in an ACT yard. This framework first models the AGV scheduling process as a Markov decision process (MDP). Furthermore, a novel scheduling algorithm called MDAS is proposed based on multi-agent deep deterministic policy gradient (MADDPG) to facilitate online real-time scheduling decision-making. Finally, simulation experiments show that the proposed method can effectively enhance the operational efficiency and energy use performance of AGVs in ACT yards of various scales by comparing with benchmarking methods.

AB - The increasing vessel size and automation level have shifted the productivity bottleneck of automated container terminals from the terminal side to the yard side. Operating an automated container terminal (ACT) yard with a big number of automated guided vehicles (AGV) is challenging due to the complexity and dynamics of the system, severely affecting the operational efficiency and energy use efficiency. In this paper, a hybrid multi-AGV scheduling algorithm is proposed to minimise the energy consumption and the total makespan of AGVs in an ACT yard. This framework first models the AGV scheduling process as a Markov decision process (MDP). Furthermore, a novel scheduling algorithm called MDAS is proposed based on multi-agent deep deterministic policy gradient (MADDPG) to facilitate online real-time scheduling decision-making. Finally, simulation experiments show that the proposed method can effectively enhance the operational efficiency and energy use performance of AGVs in ACT yards of various scales by comparing with benchmarking methods.

KW - AGV real-time scheduling

KW - actor-critic networks

KW - container terminal yard

KW - deep reinforcement learning

KW - multi-agent systems

UR - http://www.scopus.com/inward/record.url?scp=85188844848&partnerID=8YFLogxK

U2 - 10.1080/00207543.2024.2325583

DO - 10.1080/00207543.2024.2325583

M3 - Article

AN - SCOPUS:85188844848

SN - 0020-7543

VL - 62

SP - 7722

EP - 7742

JO - International Journal of Production Research

JF - International Journal of Production Research

IS - 21

ER -

Real-time AGV scheduling optimisation method with deep reinforcement learning for energy-efficiency in the container terminal yard

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this