TY - JOUR
T1 - A VMamba-based Spatial-Spectral Fusion Network for Remote Sensing Image Classification
AU - Luo, Lan
AU - Zhang, Yanmei
AU - Xu, Yanbing
AU - Yue, Tingxuan
AU - Wang, Yuxi
N1 - Publisher Copyright:
© 2008-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - In hyperspectral (HS) and light detection and ranging (LiDAR) collaborative classification, HS provides rich spectral information, while LiDAR offers unique elevation data. However, existing methods often focus on feature extraction within individual modalities before fusion, which may bring about insufficient fusion due to a lack of inter-modal complementarity and interaction. To address this, we propose a framework for HS and LiDAR fusion classification based on the VMamba model, called SSFN, which includes a dual supplement network (DSN) and a VMamba-based integration network (VMIN), modeling long-range dependencies and fully leveraging the correlation and complementarity of heterogeneous information. The DSN, comprising a spatial supplement network (Spa-SN) and a spectral supplement network (Spe-SN), is devised to supplement missing features for each modality. The Spa-SN complements the spatial features of HS by capturing spatial correlations between LiDAR and HS, and the Spe-SN employs spectral information from HS to compensate for the spectral features missing in LiDAR. Thus, both HS and LiDAR have a comprehensive spatial-spectral description. The VMIN is then utilized for augmentation and interaction of supplemented features, and discriminative features are adaptively selected for classification. Extensive experiments on three benchmark datasets demonstrate that our method outperforms multiple state-of-the-art methods and needs the fewest parameters.
AB - In hyperspectral (HS) and light detection and ranging (LiDAR) collaborative classification, HS provides rich spectral information, while LiDAR offers unique elevation data. However, existing methods often focus on feature extraction within individual modalities before fusion, which may bring about insufficient fusion due to a lack of inter-modal complementarity and interaction. To address this, we propose a framework for HS and LiDAR fusion classification based on the VMamba model, called SSFN, which includes a dual supplement network (DSN) and a VMamba-based integration network (VMIN), modeling long-range dependencies and fully leveraging the correlation and complementarity of heterogeneous information. The DSN, comprising a spatial supplement network (Spa-SN) and a spectral supplement network (Spe-SN), is devised to supplement missing features for each modality. The Spa-SN complements the spatial features of HS by capturing spatial correlations between LiDAR and HS, and the Spe-SN employs spectral information from HS to compensate for the spectral features missing in LiDAR. Thus, both HS and LiDAR have a comprehensive spatial-spectral description. The VMIN is then utilized for augmentation and interaction of supplemented features, and discriminative features are adaptively selected for classification. Extensive experiments on three benchmark datasets demonstrate that our method outperforms multiple state-of-the-art methods and needs the fewest parameters.
KW - Fusion classification
KW - hyperspectral (HS)
KW - light detection and ranging (LiDAR)
KW - spatial-spectral supplement
KW - VMamba
UR - http://www.scopus.com/inward/record.url?scp=105005995913&partnerID=8YFLogxK
U2 - 10.1109/JSTARS.2025.3573289
DO - 10.1109/JSTARS.2025.3573289
M3 - Article
AN - SCOPUS:105005995913
SN - 1939-1404
JO - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
JF - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
ER -