TY - JOUR
T1 - DMSSN
T2 - Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection
AU - Qin, Haolin
AU - Xu, Tingfa
AU - Liu, Peifu
AU - Xu, Jingxuan
AU - Li, Jianan
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - Hyperspectral salient object detection (HSOD) has exhibited remarkable promise across various applications, particularly in intricate scenarios where conventional RGB-based approaches fall short. Despite the considerable progress in HSOD method advancements, two critical challenges require immediate attention. First, existing hyperspectral data dimension reduction techniques incur a loss of spectral information, which adversely affects detection accuracy. Second, previous methods insufficiently harness the inherent distinctive attributes of hyperspectral images (HSIs) during the feature extraction process. To address these challenges, we propose a novel approach termed the distilled mixed spectral-spatial network (DMSSN), comprising a distilled spectral encoding process and a mixed spectral-spatial transformer (MSST) feature extraction network. The encoding process utilizes knowledge distillation to construct a lightweight autoencoder for dimension reduction, striking a balance between robust encoding capabilities and low computational costs. The MSST extracts spectral-spatial features through multiple attention head groups, collaboratively enhancing its resistance to intricate scenarios. Moreover, we have created a large-scale HSOD dataset, HSOD-BIT, to tackle the issue of data scarcity in this field and meet the fundamental data requirements of deep network training. Extensive experiments demonstrate that our proposed DMSSN achieves the state-of-the-art performance on multiple datasets. We will soon make the code and dataset publicly available on https://github.com/anonymous0519/HSOD-BIT.
AB - Hyperspectral salient object detection (HSOD) has exhibited remarkable promise across various applications, particularly in intricate scenarios where conventional RGB-based approaches fall short. Despite the considerable progress in HSOD method advancements, two critical challenges require immediate attention. First, existing hyperspectral data dimension reduction techniques incur a loss of spectral information, which adversely affects detection accuracy. Second, previous methods insufficiently harness the inherent distinctive attributes of hyperspectral images (HSIs) during the feature extraction process. To address these challenges, we propose a novel approach termed the distilled mixed spectral-spatial network (DMSSN), comprising a distilled spectral encoding process and a mixed spectral-spatial transformer (MSST) feature extraction network. The encoding process utilizes knowledge distillation to construct a lightweight autoencoder for dimension reduction, striking a balance between robust encoding capabilities and low computational costs. The MSST extracts spectral-spatial features through multiple attention head groups, collaboratively enhancing its resistance to intricate scenarios. Moreover, we have created a large-scale HSOD dataset, HSOD-BIT, to tackle the issue of data scarcity in this field and meet the fundamental data requirements of deep network training. Extensive experiments demonstrate that our proposed DMSSN achieves the state-of-the-art performance on multiple datasets. We will soon make the code and dataset publicly available on https://github.com/anonymous0519/HSOD-BIT.
KW - Attention mechanism
KW - hyperspectral images (HSIs)
KW - knowledge distillation
KW - salient object detection (SOD)
UR - http://www.scopus.com/inward/record.url?scp=85188525180&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2024.3379380
DO - 10.1109/TGRS.2024.3379380
M3 - Article
AN - SCOPUS:85188525180
SN - 0196-2892
VL - 62
SP - 1
EP - 18
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5512618
ER -