TY - GEN
T1 - MAPF scenario modelling for human-robot collaboration
AU - Tian, Xuan
AU - Hu, Yaoguang
AU - Wang, Jingfei
AU - Yang, Xiaonan
AU - Yang, Jianxin
AU - Hu, Xiang
AU - Xu, Wenping
AU - Li, Mingyu
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - With the rapid development of artificial intelligence, the multi-agent pathfinding problem (MAPF) has emerged in recent years. It is regarded as an NP-Hard problem that involves coordinating the movements of multiple agents within the same environment to perform different tasks. In recent years, MAPF has found numerous applications in automated industrial scenarios, prompting the proposal of several learning-based methods to solve MAPF challenges. In fact, in industrial application scenarios, the completion of complex tasks can be hampered by the robots' limited observational abilities to perceive their environment. Therefore, it is necessary to combine the flexibility of humans with intelligent manufacturing to achieve high levels of flexibility, efficiency, and safety through human-machine collaboration. This study proposes a model for human-machine collaboration in MAPF scenarios, based on the characteristics of human-machine interactions. The effectiveness of a multi-agent reinforcement learning algorithm in addressing dynamic MAPF problems in human-machine collaboration is verified. Finally, building on the classical multi-agent reinforcement learning algorithm MAA2C, the A*-MAA2C algorithm is proposed. Ablation experiments have verified its superiority over the standard MAA2C algorithm.
AB - With the rapid development of artificial intelligence, the multi-agent pathfinding problem (MAPF) has emerged in recent years. It is regarded as an NP-Hard problem that involves coordinating the movements of multiple agents within the same environment to perform different tasks. In recent years, MAPF has found numerous applications in automated industrial scenarios, prompting the proposal of several learning-based methods to solve MAPF challenges. In fact, in industrial application scenarios, the completion of complex tasks can be hampered by the robots' limited observational abilities to perceive their environment. Therefore, it is necessary to combine the flexibility of humans with intelligent manufacturing to achieve high levels of flexibility, efficiency, and safety through human-machine collaboration. This study proposes a model for human-machine collaboration in MAPF scenarios, based on the characteristics of human-machine interactions. The effectiveness of a multi-agent reinforcement learning algorithm in addressing dynamic MAPF problems in human-machine collaboration is verified. Finally, building on the classical multi-agent reinforcement learning algorithm MAA2C, the A*-MAA2C algorithm is proposed. Ablation experiments have verified its superiority over the standard MAA2C algorithm.
KW - Human-Robot Collaboration
KW - MAPF
KW - Multi-Agent Reinforcement Learning
UR - https://www.scopus.com/pages/publications/105033886411
U2 - 10.1109/IEEM63636.2025.11357613
DO - 10.1109/IEEM63636.2025.11357613
M3 - Conference contribution
AN - SCOPUS:105033886411
T3 - IEEE International Conference on Industrial Engineering and Engineering Management
SP - 1356
EP - 1360
BT - IEEM 2025 - IEEE International Conference on Industrial Engineering and Engineering Management
PB - IEEE Computer Society
T2 - 2025 IEEE International Conference on Industrial Engineering and Engineering Management, IEEM 2025
Y2 - 7 December 2025 through 10 December 2025
ER -