TY - GEN
T1 - Dynamic Obstacle Avoidance for Magnetic Helical Microrobots Based on Deep Reinforcement Learning
AU - Qiu, Yukang
AU - Hou, Yaozhen
AU - Yang, Haotian
AU - Gao, Yigao
AU - Huang, Hen Wei
AU - Shi, Qing
AU - Huang, Qiang
AU - Wang, Huaping
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Magnetic helical microrobots hold immense promise in biomedical domains owing to their compact size and efficient propulsion capabilities. However, navigating these microrobots through dynamic and unstructured environments, particularly when encountering numerous dynamic obstacles, remains a formidable challenge. In the study, a control framework based on deep reinforcement learning (DRL) with the objective of guiding a microrobot through dynamic obstacles towards specified target goals is introduced. Initially, we design and fabricate a microdrill capable of propulsion via external magnetic rotating fields produced by our magnetic actuation system. Subsequently, we construct a custom training environment, adhering to the OpenAI gym interface, to serve as the simulator for training purposes. Utilizing the proximal policy optimization algorithm, we conduct training of the navigation policy within this simulator. Simulations and experimental validations conducted in dynamic environments affirms the efficacy of the proposed method.
AB - Magnetic helical microrobots hold immense promise in biomedical domains owing to their compact size and efficient propulsion capabilities. However, navigating these microrobots through dynamic and unstructured environments, particularly when encountering numerous dynamic obstacles, remains a formidable challenge. In the study, a control framework based on deep reinforcement learning (DRL) with the objective of guiding a microrobot through dynamic obstacles towards specified target goals is introduced. Initially, we design and fabricate a microdrill capable of propulsion via external magnetic rotating fields produced by our magnetic actuation system. Subsequently, we construct a custom training environment, adhering to the OpenAI gym interface, to serve as the simulator for training purposes. Utilizing the proximal policy optimization algorithm, we conduct training of the navigation policy within this simulator. Simulations and experimental validations conducted in dynamic environments affirms the efficacy of the proposed method.
UR - http://www.scopus.com/inward/record.url?scp=85205303533&partnerID=8YFLogxK
U2 - 10.1109/RCAR61438.2024.10670936
DO - 10.1109/RCAR61438.2024.10670936
M3 - Conference contribution
AN - SCOPUS:85205303533
T3 - 2024 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2024
SP - 298
EP - 303
BT - 2024 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2024
Y2 - 24 June 2024 through 28 June 2024
ER -