DMSSN: Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection

Haolin Qin; Tingfa Xu; Peifu Liu; Jingxuan Xu; Jianan Li

doi:10.1109/TGRS.2024.3379380

DMSSN: Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection

Haolin Qin, Tingfa Xu, Peifu Liu, Jingxuan Xu^*, Jianan Li^*

^*Corresponding author for this work

School of Optics and Photonics

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

Abstract

Hyperspectral salient object detection (HSOD) has exhibited remarkable promise across various applications, particularly in intricate scenarios where conventional RGB-based approaches fall short. Despite the considerable progress in HSOD method advancements, two critical challenges require immediate attention. First, existing hyperspectral data dimension reduction techniques incur a loss of spectral information, which adversely affects detection accuracy. Second, previous methods insufficiently harness the inherent distinctive attributes of hyperspectral images (HSIs) during the feature extraction process. To address these challenges, we propose a novel approach termed the distilled mixed spectral-spatial network (DMSSN), comprising a distilled spectral encoding process and a mixed spectral-spatial transformer (MSST) feature extraction network. The encoding process utilizes knowledge distillation to construct a lightweight autoencoder for dimension reduction, striking a balance between robust encoding capabilities and low computational costs. The MSST extracts spectral-spatial features through multiple attention head groups, collaboratively enhancing its resistance to intricate scenarios. Moreover, we have created a large-scale HSOD dataset, HSOD-BIT, to tackle the issue of data scarcity in this field and meet the fundamental data requirements of deep network training. Extensive experiments demonstrate that our proposed DMSSN achieves the state-of-the-art performance on multiple datasets. We will soon make the code and dataset publicly available on https://github.com/anonymous0519/HSOD-BIT.

Original language	English
Article number	5512618
Pages (from-to)	1-18
Number of pages	18
Journal	IEEE Transactions on Geoscience and Remote Sensing
Volume	62
DOIs	https://doi.org/10.1109/TGRS.2024.3379380
Publication status	Published - 2024

Keywords

Attention mechanism
hyperspectral images (HSIs)
knowledge distillation
salient object detection (SOD)

Access to Document

10.1109/TGRS.2024.3379380

Cite this

@article{a7f2f6ab49e04425ad9949500db087e0,

title = "DMSSN: Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection",

abstract = "Hyperspectral salient object detection (HSOD) has exhibited remarkable promise across various applications, particularly in intricate scenarios where conventional RGB-based approaches fall short. Despite the considerable progress in HSOD method advancements, two critical challenges require immediate attention. First, existing hyperspectral data dimension reduction techniques incur a loss of spectral information, which adversely affects detection accuracy. Second, previous methods insufficiently harness the inherent distinctive attributes of hyperspectral images (HSIs) during the feature extraction process. To address these challenges, we propose a novel approach termed the distilled mixed spectral-spatial network (DMSSN), comprising a distilled spectral encoding process and a mixed spectral-spatial transformer (MSST) feature extraction network. The encoding process utilizes knowledge distillation to construct a lightweight autoencoder for dimension reduction, striking a balance between robust encoding capabilities and low computational costs. The MSST extracts spectral-spatial features through multiple attention head groups, collaboratively enhancing its resistance to intricate scenarios. Moreover, we have created a large-scale HSOD dataset, HSOD-BIT, to tackle the issue of data scarcity in this field and meet the fundamental data requirements of deep network training. Extensive experiments demonstrate that our proposed DMSSN achieves the state-of-the-art performance on multiple datasets. We will soon make the code and dataset publicly available on https://github.com/anonymous0519/HSOD-BIT.",

keywords = "Attention mechanism, hyperspectral images (HSIs), knowledge distillation, salient object detection (SOD)",

author = "Haolin Qin and Tingfa Xu and Peifu Liu and Jingxuan Xu and Jianan Li",

note = "Publisher Copyright: {\textcopyright} 1980-2012 IEEE.",

year = "2024",

doi = "10.1109/TGRS.2024.3379380",

language = "English",

volume = "62",

pages = "1--18",

journal = "IEEE Transactions on Geoscience and Remote Sensing",

issn = "0196-2892",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - DMSSN

T2 - Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection

AU - Qin, Haolin

AU - Xu, Tingfa

AU - Liu, Peifu

AU - Xu, Jingxuan

AU - Li, Jianan

PY - 2024

Y1 - 2024

N2 - Hyperspectral salient object detection (HSOD) has exhibited remarkable promise across various applications, particularly in intricate scenarios where conventional RGB-based approaches fall short. Despite the considerable progress in HSOD method advancements, two critical challenges require immediate attention. First, existing hyperspectral data dimension reduction techniques incur a loss of spectral information, which adversely affects detection accuracy. Second, previous methods insufficiently harness the inherent distinctive attributes of hyperspectral images (HSIs) during the feature extraction process. To address these challenges, we propose a novel approach termed the distilled mixed spectral-spatial network (DMSSN), comprising a distilled spectral encoding process and a mixed spectral-spatial transformer (MSST) feature extraction network. The encoding process utilizes knowledge distillation to construct a lightweight autoencoder for dimension reduction, striking a balance between robust encoding capabilities and low computational costs. The MSST extracts spectral-spatial features through multiple attention head groups, collaboratively enhancing its resistance to intricate scenarios. Moreover, we have created a large-scale HSOD dataset, HSOD-BIT, to tackle the issue of data scarcity in this field and meet the fundamental data requirements of deep network training. Extensive experiments demonstrate that our proposed DMSSN achieves the state-of-the-art performance on multiple datasets. We will soon make the code and dataset publicly available on https://github.com/anonymous0519/HSOD-BIT.

AB - Hyperspectral salient object detection (HSOD) has exhibited remarkable promise across various applications, particularly in intricate scenarios where conventional RGB-based approaches fall short. Despite the considerable progress in HSOD method advancements, two critical challenges require immediate attention. First, existing hyperspectral data dimension reduction techniques incur a loss of spectral information, which adversely affects detection accuracy. Second, previous methods insufficiently harness the inherent distinctive attributes of hyperspectral images (HSIs) during the feature extraction process. To address these challenges, we propose a novel approach termed the distilled mixed spectral-spatial network (DMSSN), comprising a distilled spectral encoding process and a mixed spectral-spatial transformer (MSST) feature extraction network. The encoding process utilizes knowledge distillation to construct a lightweight autoencoder for dimension reduction, striking a balance between robust encoding capabilities and low computational costs. The MSST extracts spectral-spatial features through multiple attention head groups, collaboratively enhancing its resistance to intricate scenarios. Moreover, we have created a large-scale HSOD dataset, HSOD-BIT, to tackle the issue of data scarcity in this field and meet the fundamental data requirements of deep network training. Extensive experiments demonstrate that our proposed DMSSN achieves the state-of-the-art performance on multiple datasets. We will soon make the code and dataset publicly available on https://github.com/anonymous0519/HSOD-BIT.

KW - Attention mechanism

KW - hyperspectral images (HSIs)

KW - knowledge distillation

KW - salient object detection (SOD)

UR - http://www.scopus.com/inward/record.url?scp=85188525180&partnerID=8YFLogxK

U2 - 10.1109/TGRS.2024.3379380

DO - 10.1109/TGRS.2024.3379380

M3 - Article

AN - SCOPUS:85188525180

SN - 0196-2892

VL - 62

SP - 1

EP - 18

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

M1 - 5512618

ER -

DMSSN: Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this