TY - GEN
T1 - Optimization Method for the Propagation Path of Complex Product Design Change Based on Reinforcement Learning
AU - Wei, Zhuqin
AU - Li, Haokun
AU - Li, Guannan
AU - Wang, Ru
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
PY - 2024
Y1 - 2024
N2 - Design changes are a common situation in the update and replacement of complex products, but the parts of complex products have characteristics such as large quantity, high coupling degree, and complex assembly relationships, which leads to significant decision-making difficulties for designers when a part has a design change. To address the above problems, this paper proposes a reinforcement learning-based method for optimizing the propagation path of complex product design changes. Taking parts as state nodes and design change propagation impact as actions, the reward function is determined by considering three objectives: product change time, change cost, and assembly difficulty. A Markov decision process (MDP) model for designing change propagation path optimization is constructed, and Monte Carlo reinforcement learning is used to solve the model. This method provides optimal design change propagation paths for designers. Finally, a robot suspension system design change is used as an example to verify the feasibility and effectiveness of the method.
AB - Design changes are a common situation in the update and replacement of complex products, but the parts of complex products have characteristics such as large quantity, high coupling degree, and complex assembly relationships, which leads to significant decision-making difficulties for designers when a part has a design change. To address the above problems, this paper proposes a reinforcement learning-based method for optimizing the propagation path of complex product design changes. Taking parts as state nodes and design change propagation impact as actions, the reward function is determined by considering three objectives: product change time, change cost, and assembly difficulty. A Markov decision process (MDP) model for designing change propagation path optimization is constructed, and Monte Carlo reinforcement learning is used to solve the model. This method provides optimal design change propagation paths for designers. Finally, a robot suspension system design change is used as an example to verify the feasibility and effectiveness of the method.
KW - Complex products
KW - Design changes
KW - Propagation path
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85199262986&partnerID=8YFLogxK
U2 - 10.1007/978-981-97-0922-9_13
DO - 10.1007/978-981-97-0922-9_13
M3 - Conference contribution
AN - SCOPUS:85199262986
SN - 9789819709212
T3 - Mechanisms and Machine Science
SP - 207
EP - 218
BT - Advances in Mechanical Design - The Proceedings of the 2023 International Conference on Mechanical Design, ICMD 2023
A2 - Tan, Jianrong
A2 - Liu, Yu
A2 - Huang, Hong-Zhong
A2 - Yu, Jingjun
A2 - Wang, Zequn
PB - Springer Science and Business Media B.V.
T2 - International Conference on Mechanical Design, ICMD 2023
Y2 - 20 October 2023 through 22 October 2023
ER -