TY - GEN
T1 - Robust Two-Stage Image Stitching via Multi-Scale Homography Regression
AU - Lv, Yixing
AU - Chen, Linwei
AU - Fu, Ying
AU - Hu, Hao
AU - Jiang, Jian
AU - Yan, Chenggang
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Image stitching is a widely used and practical computer vision technique, which aims to generate images with a wide field of view. Traditional feature-based methods are heavily relying on the quality of geometric features, often showing poor performance in low-texture scenarios. Recently, deep learning-based approaches have demonstrated significant advantages in feature detection capability and robustness. Compared to traditional methods, deep learning-based stitching methods adaptively learn deep semantic features through powerful feature extraction networks, but they still face challenges in handling details of local areas. In this paper, we propose a two-stage supervised framework for image stitching. In the first stage, we design a supervised homography network that employs hybrid attention convolutional layers to extract multi-scale features and predicts homography transformations through multi-stage regression. In the second stage, we designed an attention-feature reconstruction module to better learn seam information and eliminate artifacts present in coarse-aligned images. Experimental results demonstrate that our approach outperforms existing homography prediction and image stitching methods both qualitatively and quantitatively, while also show better performance in local details processing.
AB - Image stitching is a widely used and practical computer vision technique, which aims to generate images with a wide field of view. Traditional feature-based methods are heavily relying on the quality of geometric features, often showing poor performance in low-texture scenarios. Recently, deep learning-based approaches have demonstrated significant advantages in feature detection capability and robustness. Compared to traditional methods, deep learning-based stitching methods adaptively learn deep semantic features through powerful feature extraction networks, but they still face challenges in handling details of local areas. In this paper, we propose a two-stage supervised framework for image stitching. In the first stage, we design a supervised homography network that employs hybrid attention convolutional layers to extract multi-scale features and predicts homography transformations through multi-stage regression. In the second stage, we designed an attention-feature reconstruction module to better learn seam information and eliminate artifacts present in coarse-aligned images. Experimental results demonstrate that our approach outperforms existing homography prediction and image stitching methods both qualitatively and quantitatively, while also show better performance in local details processing.
KW - attention
KW - image stitching
KW - multi-scale
KW - regression
UR - https://www.scopus.com/pages/publications/105036382470
U2 - 10.1109/ICVISP68610.2025.11451726
DO - 10.1109/ICVISP68610.2025.11451726
M3 - Conference contribution
AN - SCOPUS:105036382470
T3 - ICVISP 2025 Proceedings - 2025 9th International Conference on Vision, Image and Signal Processing
BT - ICVISP 2025 Proceedings - 2025 9th International Conference on Vision, Image and Signal Processing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on Vision, Image and Signal Processing, ICVISP 2025
Y2 - 28 November 2025 through 30 November 2025
ER -