TY - JOUR
T1 - GraphAVO
T2 - Self-Supervised Visual Odometry Based on Graph-Assisted Geometric Consistency
AU - Song, Rujun
AU - Liu, Jiaqi
AU - Xiao, Zhuoling
AU - Yan, Bo
N1 - Publisher Copyright:
© 2000-2011 IEEE.
PY - 2024
Y1 - 2024
N2 - Learning-based monocular visual odometry (VO) has recently attracted considerable attention for its robustness to camera parameters and environmental variations. Despite traditional pose graph optimization enhancing pose accuracy, its integration with deep learning may lead to error accumulation due to insufficient motion information exchange. Our method, GraphAVO, concentrates simultaneously on the adjacent and interval co-visibility correspondence to establish a feature and pose graph optimization for pose consistency. We design a graph-assisted Windowed Feature Graph Refinement (WFGR) component to operationalize pose graph optimization for deep feature refinement. The geometric consistency is further constrained by a Cycle Consistency Loss. Additionally, the Cascade Dilated Convolution Fusion (CDCF) component is incorporated to handle different degrees of pixel movement, facilitating the joint detection of slight and distinct motion cues for subsequent feature enhancement. Extensive experiments on the KITTI, Malaga, RobotCar, and self-collected outdoor datasets have demonstrated the promising performance and generalization ability of GraphAVO. It achieves competitive results against classical algorithms and outperforms related state-of-the-art methods by up to 24.4% and 40.1% on average translational and rotational evaluation, respectively.
AB - Learning-based monocular visual odometry (VO) has recently attracted considerable attention for its robustness to camera parameters and environmental variations. Despite traditional pose graph optimization enhancing pose accuracy, its integration with deep learning may lead to error accumulation due to insufficient motion information exchange. Our method, GraphAVO, concentrates simultaneously on the adjacent and interval co-visibility correspondence to establish a feature and pose graph optimization for pose consistency. We design a graph-assisted Windowed Feature Graph Refinement (WFGR) component to operationalize pose graph optimization for deep feature refinement. The geometric consistency is further constrained by a Cycle Consistency Loss. Additionally, the Cascade Dilated Convolution Fusion (CDCF) component is incorporated to handle different degrees of pixel movement, facilitating the joint detection of slight and distinct motion cues for subsequent feature enhancement. Extensive experiments on the KITTI, Malaga, RobotCar, and self-collected outdoor datasets have demonstrated the promising performance and generalization ability of GraphAVO. It achieves competitive results against classical algorithms and outperforms related state-of-the-art methods by up to 24.4% and 40.1% on average translational and rotational evaluation, respectively.
KW - Feature refinement
KW - graph neural network
KW - pose consistency
KW - self-supervised learning
KW - visual odometry
UR - http://www.scopus.com/inward/record.url?scp=85207790504&partnerID=8YFLogxK
U2 - 10.1109/TITS.2024.3462596
DO - 10.1109/TITS.2024.3462596
M3 - Article
AN - SCOPUS:85207790504
SN - 1524-9050
VL - 25
SP - 20673
EP - 20682
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 12
ER -