跳到主要导航 跳到搜索 跳到主要内容

Digital twin-driven multi-agent collaborative online optimization of production regulation for smart reconfigurable manufacturing systems with human-robot collaboration

  • Ming Huang
  • , Sihan Huang*
  • , Liang Gao
  • , Wei Dong
  • , Yonghui Zhang
  • , Xi Gu
  • , Zenggui Gao
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • Ministry of Industry and Information Technology
  • Huazhong University of Science and Technology
  • Dalian University of Technology
  • Shanghai University

科研成果: 期刊稿件文章同行评审

摘要

In Industry 4.0 era, the synchronous development of high technologies and advanced manufacturing technologies significantly fill the gap between demand and production, where the smart reconfigurable manufacturing systems consisting of more and more robots is one of the best solutions to handle the dynamic, uncertain, and individual demands. The emerging Industry 5.0 provides a brand-new perspective for enhancing the flexibility of production by emphasizing human centricity resulting in popular human-robot collaboration. Accordingly, smart reconfigurable manufacturing systems with human-robot collaboration (SRMS-HRC) combining the advantages of Industry 4.0 and Industry 5.0 shows vigorous vitality for future production, which is featured by high-level flexibility and smartness simultaneously. However, the ultrahigh flexibility of SRMS-HRC brings in extra challenges for production organization, including changeable operation sequences, dynamic position assignment, and human-robot configuration. Meanwhile, the traditional offline optimization of planning-first-and-executing-later cannot satisfy the rapid responsiveness of SRMS-HRC. Therefore, a digital twin-driven multi-agent collaborative online optimization of production regulation for SRMS-HRC based on multi-objective deep reinforcement learning is proposed in this paper, where digital twin is adopted to enhance responsiveness for changes and deep reinforcement learning is used to generate online optimization schemes in a smarter way. A Markov decision process with order size-independent state representations and a rule-based multi-agent action set with explicit semantics is constructed. A normalized Tchebycheff reward aggregation method enables simultaneous optimization of enterprise-level and human-centered objectives. Furthermore, a multi-objective multi-agent twin delayed deep deterministic policy gradient (MO-MATD3) is developed to train collaborating agents. Numerical experiments and simulation scenario validations on industrial cases of varying scales demonstrate that the proposed approach effectively generates high-performance online optimization schemes, outperforming single-agent architectures, heuristic rules, and other deep reinforcement learning methods.

源语言英语
文章编号103322
期刊Robotics and Computer-Integrated Manufacturing
101
DOI
出版状态已出版 - 10月 2026
已对外发布

指纹

探究 'Digital twin-driven multi-agent collaborative online optimization of production regulation for smart reconfigurable manufacturing systems with human-robot collaboration' 的科研主题。它们共同构成独一无二的指纹。

引用此