D2VT: Better Detection and Description of Local Features with Vision Transformers

Yifei Yang, Zihao Wang, Zhen Li, Fang Deng, Yidian Huang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Constrained by the local nature of CNNs, existing local feature description methods often overlook global and contextual spatial information. Vision Transformers (ViT) address this by leveraging self-attention to capture long-range dependencies and preserve spatial details more effectively than CNNs. Our work introduces a hybrid architecture that merges CNNs for local feature extraction with ViT for global feature capture, enhancing performance across diverse vision tasks. We propose a novel hierarchical Transformer encoder adaptable to various image resolutions, yielding multi-scale features without positional encoding. Additionally, we introduce a consistent attention-weighted triple loss to get the attention map and to optimize and match local descriptors. Utilizing a feature pyramid, our method predicts keypoints at multiple scales, leading to improved localization accuracy. Experiments have shown that our approach is competitive with the leading contrastive learning methods in image matching benchmarks and demonstrates robust generalization in tasks like visual odometry.

源语言英语
主期刊名Proceedings - 2024 China Automation Congress, CAC 2024
出版商Institute of Electrical and Electronics Engineers Inc.
7110-7115
页数6
ISBN(电子版)9798350368604
DOI
出版状态已出版 - 2024
活动2024 China Automation Congress, CAC 2024 - Qingdao, 中国
期限: 1 11月 20243 11月 2024

出版系列

姓名Proceedings - 2024 China Automation Congress, CAC 2024

会议

会议2024 China Automation Congress, CAC 2024
国家/地区中国
Qingdao
时期1/11/243/11/24

指纹

探究 'D2VT: Better Detection and Description of Local Features with Vision Transformers' 的科研主题。它们共同构成独一无二的指纹。

引用此

Yang, Y., Wang, Z., Li, Z., Deng, F., & Huang, Y. (2024). D2VT: Better Detection and Description of Local Features with Vision Transformers. 在 Proceedings - 2024 China Automation Congress, CAC 2024 (页码 7110-7115). (Proceedings - 2024 China Automation Congress, CAC 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CAC63892.2024.10864608