TY - JOUR
T1 - SDPENetv2
T2 - Spacecraft Pose Estimation Network With Learnable Token Head Based on Discrete Pose Weights
AU - Zhou, Hang
AU - Yao, Lu
AU - She, Haoping
AU - Si, Weiyong
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2025
Y1 - 2025
N2 - Reliable pose estimation of non-cooperative spacecraft is a key technology for on-orbit servicing and active space debris removal missions. Currently, deep learning has become the mainstream method for spacecraft pose estimation. However, existing methods suffer from problems such as excessive parameters, high computational complexity, and relatively low efficiency in feature utilization. This letter proposes SDPENetv2 to address these issues. We represent the pose of the target spacecraft using discrete pose weights. By introducing an additional constraint term into the loss function, the network can learn a more appropriate weight distribution in the later stages of training, improving the accuracy of pose estimation. In addition, we propose the learnable token head, which possesses global attention and can make more comprehensive use of the features extracted by the convolutional neural network. Experiments on the SPEED dataset demonstrate that the position and attitude estimation error of SDPENetv2 are reduced to 0.105 m and 1.145◦, respectively. Compared with other works, these errors are reduced by 20.45% –86.59% and 32.65% –91.80%, respectively. Additionally, SDPENetv2 has only 5.6 M parameters and a computational complexity of merely 1.542 GMACs.
AB - Reliable pose estimation of non-cooperative spacecraft is a key technology for on-orbit servicing and active space debris removal missions. Currently, deep learning has become the mainstream method for spacecraft pose estimation. However, existing methods suffer from problems such as excessive parameters, high computational complexity, and relatively low efficiency in feature utilization. This letter proposes SDPENetv2 to address these issues. We represent the pose of the target spacecraft using discrete pose weights. By introducing an additional constraint term into the loss function, the network can learn a more appropriate weight distribution in the later stages of training, improving the accuracy of pose estimation. In addition, we propose the learnable token head, which possesses global attention and can make more comprehensive use of the features extracted by the convolutional neural network. Experiments on the SPEED dataset demonstrate that the position and attitude estimation error of SDPENetv2 are reduced to 0.105 m and 1.145◦, respectively. Compared with other works, these errors are reduced by 20.45% –86.59% and 32.65% –91.80%, respectively. Additionally, SDPENetv2 has only 5.6 M parameters and a computational complexity of merely 1.542 GMACs.
KW - aerial systems: Applications
KW - AI-based methods
KW - computer vision for automation
UR - https://www.scopus.com/pages/publications/105020741995
U2 - 10.1109/LRA.2025.3627081
DO - 10.1109/LRA.2025.3627081
M3 - Article
AN - SCOPUS:105020741995
SN - 2377-3766
VL - 10
SP - 13034
EP - 13041
JO - IEEE Robotics and Automation Letters
JF - IEEE Robotics and Automation Letters
IS - 12
ER -