跳到主要导航 跳到搜索 跳到主要内容

Large-Scale Multirobot Task Planning Using Efficient Hierarchical Reinforcement Learning

  • Beijing Institute of Technology
  • Tsinghua University
  • Ltd.
  • Ltd.
  • Tongji University

科研成果: 期刊稿件文章同行评审

摘要

Multirobot task planning (MRTP) at scale in robotic mobile fulfillment systems (RMFS) remains a challenge due to the curse of dimensionality and complex dynamic properties. Aiming to solve these challenges, we construct an end-to-end scalable multirobot task planner capable of scaling to large-scale systems by learning hierarchical planning policies. In this planner, we design a centralized hierarchical temporal task planning framework to mitigate the curse of dimensionality while ensuring timely dynamic response. Following this framework, we propose a novel cycle-constrained asynchronous temporal graph to provide foundation for modeling the system dynamics. Based on the graph representation, we formulate the MRTP problem as a semi-Markov decision process (SMDP) that focuses solely on critical interaction points to improve computational and sampling efficiency. The policies in SMDP are parameterized via a hierarchical temporal attention network with temporal embedding layers to enhance spatio-temporal feature extraction. In addition, the decoder masks in this network naturally ensure that the generated actions strictly satisfy the required dynamic hard constraints. The above hierarchical policies are jointly optimized using an efficient hierarchical REINFORCE with rollout counterfactual baseline method. To further enhance generalization performance on unlearned instances while preventing catastrophic forgetting, we extend it with region expansion curricula. Experiments demonstrate that our planner outperforms state-of-the-art methods on different MRTP instances across simulated and real-world RMFS. It successfully scales to instances with up to 200 robots, 1000 retrieval racks on unlearned maps while maintaining performance advantages.

源语言英语
页(从-至)2146-2165
页数20
期刊IEEE Transactions on Robotics
42
DOI
出版状态已出版 - 2026
已对外发布

指纹

探究 'Large-Scale Multirobot Task Planning Using Efficient Hierarchical Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此