TY - GEN
T1 - A DQN-Based Cargo Loading Task Planning Method for Multiple Service Desks
AU - Zhou, Wenming
AU - Cao, Tianze
AU - Tian, Tongyu
AU - Li, Haolin
AU - Cao, Yilin
AU - Zhao, Sanyuan
AU - Zheng, Lin
AU - Sun, Lin
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Addressing the cargo loading task planning problem in environments with multiple service desks of heterogeneous efficiency, this paper proposes an intelligent optimization method based on Deep Q-Networks(DQN). Traditional logistics scheduling methods face challenges in high-dimensional state representation and local optima traps when handling the strong coupling relationships among service desk efficiency differences, cargo compatibility constraints, and dynamic task allocation. This study quantifies efficiency disparities by constructing a service desk-cargo efficiency matrix, designs a composite state space integrating remaining cargo quantities and service desk timelines, and innovatively introduces an adaptive reward function that combines time cost, load balancing, and constraint penalties. Experimental results demonstrate that the proposed method reduces the total completion time of parallel operations across multiple service desks by 8.2% compared to traditional heuristic algorithms, while adhering to transport vehicle capacity and cargo compatibility constraints. This approach provides a novel theoretical framework and technical implementation pathway for dynamic task scheduling in complex logistics scenarios.
AB - Addressing the cargo loading task planning problem in environments with multiple service desks of heterogeneous efficiency, this paper proposes an intelligent optimization method based on Deep Q-Networks(DQN). Traditional logistics scheduling methods face challenges in high-dimensional state representation and local optima traps when handling the strong coupling relationships among service desk efficiency differences, cargo compatibility constraints, and dynamic task allocation. This study quantifies efficiency disparities by constructing a service desk-cargo efficiency matrix, designs a composite state space integrating remaining cargo quantities and service desk timelines, and innovatively introduces an adaptive reward function that combines time cost, load balancing, and constraint penalties. Experimental results demonstrate that the proposed method reduces the total completion time of parallel operations across multiple service desks by 8.2% compared to traditional heuristic algorithms, while adhering to transport vehicle capacity and cargo compatibility constraints. This approach provides a novel theoretical framework and technical implementation pathway for dynamic task scheduling in complex logistics scenarios.
KW - cargo loading optimization
KW - Deep reinforcement learning
KW - multi-service desk scheduling
UR - https://www.scopus.com/pages/publications/105038010147
U2 - 10.1109/AICSIP65423.2025.11427212
DO - 10.1109/AICSIP65423.2025.11427212
M3 - Conference contribution
AN - SCOPUS:105038010147
T3 - 2025 IEEE 7th International Conference on Artificial Intelligence, Computer Science, and Information Processing, AICSIP 2025
BT - 2025 IEEE 7th International Conference on Artificial Intelligence, Computer Science, and Information Processing, AICSIP 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th IEEE International Conference on Artificial Intelligence, Computer Science, and Information Processing, AICSIP 2025
Y2 - 25 July 2025 through 27 July 2025
ER -