TY - GEN
T1 - Mobile-SPEEDNet
T2 - 25th IEEE International Conference on Industrial Technology, ICIT 2024
AU - Yao, Lu
AU - She, Haoping
AU - Si, Weiyong
AU - Zhou, Hang
AU - Yang, Borui
AU - Xu, Zhongnan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Considering that the spacecraft pose estimation model method deployed on the onboard computer must have low storage and high performance, an end-to-end regression network, Mobile-SPEEDNet, is proposed. Because of the complex spatial background and sensitivity to image resolution in spacecraft pose estimation, Mobile-SPEEDNet takes MobileN et-v2 as the backbone network, embeds the Coordinate Attention module in partially the inverted residual modules, adds multi-scale feature layer fusion to enhance features, and uses the Spatial Pyramid Pooling layer to extract features, decoupling the position and attitude quaternion information for output. This paper also analyzes the impact of target distance on pose estimation, the effectiveness of attention mechanisms, and the relationship between fine-grained attitude soft assignment encoding and model performance. Finally, experimental results tested on the validation set of the SPEED synthetic dataset are presented to demonstrate the performance, and some prediction results are also presented. The Mobile-SPEEDNet 12-bins model, which has 7.1 million parameters with an average position error of 0.254 meters and an average attitude error of 5.21 degrees, achieves the optimal balance between network parameters and performance.
AB - Considering that the spacecraft pose estimation model method deployed on the onboard computer must have low storage and high performance, an end-to-end regression network, Mobile-SPEEDNet, is proposed. Because of the complex spatial background and sensitivity to image resolution in spacecraft pose estimation, Mobile-SPEEDNet takes MobileN et-v2 as the backbone network, embeds the Coordinate Attention module in partially the inverted residual modules, adds multi-scale feature layer fusion to enhance features, and uses the Spatial Pyramid Pooling layer to extract features, decoupling the position and attitude quaternion information for output. This paper also analyzes the impact of target distance on pose estimation, the effectiveness of attention mechanisms, and the relationship between fine-grained attitude soft assignment encoding and model performance. Finally, experimental results tested on the validation set of the SPEED synthetic dataset are presented to demonstrate the performance, and some prediction results are also presented. The Mobile-SPEEDNet 12-bins model, which has 7.1 million parameters with an average position error of 0.254 meters and an average attitude error of 5.21 degrees, achieves the optimal balance between network parameters and performance.
KW - deep learning
KW - Non-cooperative spacecraft
KW - pose estimation
UR - http://www.scopus.com/inward/record.url?scp=85195780643&partnerID=8YFLogxK
U2 - 10.1109/ICIT58233.2024.10540873
DO - 10.1109/ICIT58233.2024.10540873
M3 - Conference contribution
AN - SCOPUS:85195780643
T3 - Proceedings of the IEEE International Conference on Industrial Technology
BT - ICIT 2024 - 2024 25th International Conference on Industrial Technology
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 25 March 2024 through 27 March 2024
ER -