TY - JOUR
T1 - Latent Representation Self-Supervised Pose Network for Accurate Monocular Pipe Pose Estimation
AU - Hu, Jia
AU - Liu, Shaoli
AU - Liu, Jianhua
AU - Wang, Zhenjie
AU - Zhang, Wenxiong
N1 - Publisher Copyright:
© 2005-2012 IEEE.
PY - 2023/5/1
Y1 - 2023/5/1
N2 - Accurate pipe pose estimation plays a pivotal role in the automatic assembly of pipelines. Recently, data-driven deep neural networks have been proven capable of estimating pose. Nonetheless, a large number of labeled datasets are required during the training process. One effective solution is to estimate pose using self-supervised learning. However, existing algorithms are difficult to deal with textureless objects (like pipes), and they avoid the occlusion problem. To this end, in this article, we propose a latent representation self-supervised pose network (LSPN) for accurate monocular pipe pose estimation. We train our network with synthetic RGB (Red, Green, Blue) data, where only a few labeled samples are used to establish the latent pose space, whereas a large number of structured unlabeled samples are used to learn latent pose representation in self-supervised learning. Experiments demonstrate that LSPN achieves excellent performance on real data and is robust to different environments, such as illumination changes and self-occlusion.
AB - Accurate pipe pose estimation plays a pivotal role in the automatic assembly of pipelines. Recently, data-driven deep neural networks have been proven capable of estimating pose. Nonetheless, a large number of labeled datasets are required during the training process. One effective solution is to estimate pose using self-supervised learning. However, existing algorithms are difficult to deal with textureless objects (like pipes), and they avoid the occlusion problem. To this end, in this article, we propose a latent representation self-supervised pose network (LSPN) for accurate monocular pipe pose estimation. We train our network with synthetic RGB (Red, Green, Blue) data, where only a few labeled samples are used to establish the latent pose space, whereas a large number of structured unlabeled samples are used to learn latent pose representation in self-supervised learning. Experiments demonstrate that LSPN achieves excellent performance on real data and is robust to different environments, such as illumination changes and self-occlusion.
KW - Latent representation
KW - monocular
KW - pipe pose estimation
KW - self-supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85144048213&partnerID=8YFLogxK
U2 - 10.1109/TII.2022.3225028
DO - 10.1109/TII.2022.3225028
M3 - Article
AN - SCOPUS:85144048213
SN - 1551-3203
VL - 19
SP - 7180
EP - 7189
JO - IEEE Transactions on Industrial Informatics
JF - IEEE Transactions on Industrial Informatics
IS - 5
ER -