Skip to main navigation Skip to search Skip to main content

Real-time Scheduling Simulation Optimization Method for Smart Production Lines Based on Digital Twin and Reinforcement Learning

  • Zehao Yang
  • , Wei Dong
  • , Sihan Huang*
  • , Yanchao Yin
  • , Liyang Dong
  • , Zujie Zheng
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • Kunming University of Science and Technology
  • Ltd.
  • Shanghai Spaceflight Precision Machinery Institute

Research output: Contribution to journalArticlepeer-review

Abstract

Production scheduling remains a perpetual research hotspot in industry, serving as a critical metronome for the efficient operation of production lines. With the continuous evolution of intelligent manufacturing, smart production scheduling has emerged as a cutting-edge frontier. Multi-source stochastic disturbances, such as production task variations, the coupling of manufacturing resources, and others, pose a significant challenge in balancing scheduling efficiency and accuracy during dynamic production. To address this challenge, a real-time scheduling simulation optimization method based on digital twin (DT) and reinforcement learning (RL) is proposed. DT technology is used to construct high-fidelity models of production lines, establishing a hierarchical and high-fidelity virtual production simulation environment. An improved Q-Learning algorithm is developed to establish a scheduling optimization agent, incorporating triple state space reconstruction, a multi-dimensional reward function, and a dual exploration strategy to mitigate the curse of dimensionality and the robustness limitations inherent in traditional algorithms. Furthermore, a hierarchical execution control architecture is established based on perception-decision-execution loop throughout the production simulation process to achieve deep fusion between the DT and the intelligent simulation agent. A case study focusing on aerospace product final assembly line is provided to demonstrate the effectiveness of the proposed method. The result shows that the execution distances yielded by five other classical scheduling rules are 6.38% to 16.50% higher than those of the proposed method, signifying a substantial improvement in manufacturing resource collaborative efficiency.

Translated title of the contribution基于数字孪生和强化学习的智能生产线实时调度仿真优化方法
Original languageEnglish
JournalJixie Gongcheng Xuebao/Chinese Journal of Mechanical Engineering
Volume62
Issue number5
DOIs
Publication statusPublished - 5 Mar 2026

Keywords

  • digital twin
  • production scheduling
  • reinforcement learning
  • simulation optimization
  • smart production lines

Fingerprint

Dive into the research topics of 'Real-time Scheduling Simulation Optimization Method for Smart Production Lines Based on Digital Twin and Reinforcement Learning'. Together they form a unique fingerprint.

Cite this