Spectral-Spatial Adaptive Transformer Model for Hyperspectral Image Classification

Dong Wang; Sitian Liu; Chunli Zhu

doi:10.1117/12.2687107

Spectral-Spatial Adaptive Transformer Model for Hyperspectral Image Classification

Dong Wang, Sitian Liu, Chunli Zhu^*

^*Corresponding author for this work

School of Mechatronical Engineering

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Citation (Scopus)

Abstract

Hyperspectral imagery (HSI) classification, with the goal of assigning an appropriate land cover label to each hyperspectral pixel, is a challenging part of hyperspectral remote sensing. Recently, convolutional neural network-based HSI classification methods have shown superior performance due to their excellent locally contextual modeling ability. However, the ability of these methods to obtain deep semantic features is limited, and the computational cost increases markedly as the number of layers increases. In this work, we propose a novel spectral-spatial adaptive transformer (SSAT) model to adapt a pre-trained model for effective HSI classification. The main architecture of SSAT is based on vision transformer, which could aggregate features at different levels. Furthermore, we have designed an adaptive encoder block including spectral adaption, spatial adaption, and joint adaptation to extract HSI features in the spectral-spatial domains. Finally, the classification map is obtained from the fully connected layer. Extensive experiments have been conducted to validate the effectiveness of the proposed SSAT compared with seven typical HSI classification methods. Results demonstrate that the key classification evaluation index overall accuracy (OA) outperforms other comparative methods by at least 2.03%. Classification maps reveal the superior visualization effect, demonstrating that SSAT is an efficient tool for HSI classification.

Original language	English
Title of host publication	Optoelectronic Imaging and Multimedia Technology X
Editors	Qionghai Dai, Tsutomu Shimura, Zhenrong Zheng
Publisher	SPIE
ISBN (Electronic)	9781510667839
DOIs	https://doi.org/10.1117/12.2687107
Publication status	Published - 2023
Event	Optoelectronic Imaging and Multimedia Technology X 2023 - Beijing, China Duration: 15 Oct 2023 → 16 Oct 2023

Publication series

Name	Proceedings of SPIE - The International Society for Optical Engineering
Volume	12767
ISSN (Print)	0277-786X
ISSN (Electronic)	1996-756X

Conference

Conference	Optoelectronic Imaging and Multimedia Technology X 2023
Country/Territory	China
City	Beijing
Period	15/10/23 → 16/10/23

Keywords

Adaptive transformer
Hyperspectral images classification
Spatial-spectral joint adaptation

Access to Document

10.1117/12.2687107

Cite this

Wang, D., Liu, S., & Zhu, C. (2023). Spectral-Spatial Adaptive Transformer Model for Hyperspectral Image Classification. In Q. Dai, T. Shimura, & Z. Zheng (Eds.), Optoelectronic Imaging and Multimedia Technology X Article 1276709 (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 12767). SPIE. https://doi.org/10.1117/12.2687107

@inproceedings{d664d791bd9f406796c2ec705ef774d9,

title = "Spectral-Spatial Adaptive Transformer Model for Hyperspectral Image Classification",

abstract = "Hyperspectral imagery (HSI) classification, with the goal of assigning an appropriate land cover label to each hyperspectral pixel, is a challenging part of hyperspectral remote sensing. Recently, convolutional neural network-based HSI classification methods have shown superior performance due to their excellent locally contextual modeling ability. However, the ability of these methods to obtain deep semantic features is limited, and the computational cost increases markedly as the number of layers increases. In this work, we propose a novel spectral-spatial adaptive transformer (SSAT) model to adapt a pre-trained model for effective HSI classification. The main architecture of SSAT is based on vision transformer, which could aggregate features at different levels. Furthermore, we have designed an adaptive encoder block including spectral adaption, spatial adaption, and joint adaptation to extract HSI features in the spectral-spatial domains. Finally, the classification map is obtained from the fully connected layer. Extensive experiments have been conducted to validate the effectiveness of the proposed SSAT compared with seven typical HSI classification methods. Results demonstrate that the key classification evaluation index overall accuracy (OA) outperforms other comparative methods by at least 2.03%. Classification maps reveal the superior visualization effect, demonstrating that SSAT is an efficient tool for HSI classification.",

keywords = "Adaptive transformer, Hyperspectral images classification, Spatial-spectral joint adaptation",

author = "Dong Wang and Sitian Liu and Chunli Zhu",

year = "2023",

doi = "10.1117/12.2687107",

language = "English",

series = "Proceedings of SPIE - The International Society for Optical Engineering",

publisher = "SPIE",

editor = "Qionghai Dai and Tsutomu Shimura and Zhenrong Zheng",

booktitle = "Optoelectronic Imaging and Multimedia Technology X",

address = "United States",

}

Wang, D, Liu, S & Zhu, C 2023, Spectral-Spatial Adaptive Transformer Model for Hyperspectral Image Classification. in Q Dai, T Shimura & Z Zheng (eds), Optoelectronic Imaging and Multimedia Technology X., 1276709, Proceedings of SPIE - The International Society for Optical Engineering, vol. 12767, SPIE, Optoelectronic Imaging and Multimedia Technology X 2023, Beijing, China, 15/10/23. https://doi.org/10.1117/12.2687107

Spectral-Spatial Adaptive Transformer Model for Hyperspectral Image Classification. / Wang, Dong; Liu, Sitian; Zhu, Chunli.
Optoelectronic Imaging and Multimedia Technology X. ed. / Qionghai Dai; Tsutomu Shimura; Zhenrong Zheng. SPIE, 2023. 1276709 (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 12767).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Spectral-Spatial Adaptive Transformer Model for Hyperspectral Image Classification

AU - Wang, Dong

AU - Liu, Sitian

AU - Zhu, Chunli

PY - 2023

Y1 - 2023

N2 - Hyperspectral imagery (HSI) classification, with the goal of assigning an appropriate land cover label to each hyperspectral pixel, is a challenging part of hyperspectral remote sensing. Recently, convolutional neural network-based HSI classification methods have shown superior performance due to their excellent locally contextual modeling ability. However, the ability of these methods to obtain deep semantic features is limited, and the computational cost increases markedly as the number of layers increases. In this work, we propose a novel spectral-spatial adaptive transformer (SSAT) model to adapt a pre-trained model for effective HSI classification. The main architecture of SSAT is based on vision transformer, which could aggregate features at different levels. Furthermore, we have designed an adaptive encoder block including spectral adaption, spatial adaption, and joint adaptation to extract HSI features in the spectral-spatial domains. Finally, the classification map is obtained from the fully connected layer. Extensive experiments have been conducted to validate the effectiveness of the proposed SSAT compared with seven typical HSI classification methods. Results demonstrate that the key classification evaluation index overall accuracy (OA) outperforms other comparative methods by at least 2.03%. Classification maps reveal the superior visualization effect, demonstrating that SSAT is an efficient tool for HSI classification.

AB - Hyperspectral imagery (HSI) classification, with the goal of assigning an appropriate land cover label to each hyperspectral pixel, is a challenging part of hyperspectral remote sensing. Recently, convolutional neural network-based HSI classification methods have shown superior performance due to their excellent locally contextual modeling ability. However, the ability of these methods to obtain deep semantic features is limited, and the computational cost increases markedly as the number of layers increases. In this work, we propose a novel spectral-spatial adaptive transformer (SSAT) model to adapt a pre-trained model for effective HSI classification. The main architecture of SSAT is based on vision transformer, which could aggregate features at different levels. Furthermore, we have designed an adaptive encoder block including spectral adaption, spatial adaption, and joint adaptation to extract HSI features in the spectral-spatial domains. Finally, the classification map is obtained from the fully connected layer. Extensive experiments have been conducted to validate the effectiveness of the proposed SSAT compared with seven typical HSI classification methods. Results demonstrate that the key classification evaluation index overall accuracy (OA) outperforms other comparative methods by at least 2.03%. Classification maps reveal the superior visualization effect, demonstrating that SSAT is an efficient tool for HSI classification.

KW - Adaptive transformer

KW - Hyperspectral images classification

KW - Spatial-spectral joint adaptation

UR - http://www.scopus.com/inward/record.url?scp=85180128903&partnerID=8YFLogxK

U2 - 10.1117/12.2687107

DO - 10.1117/12.2687107

M3 - Conference contribution

AN - SCOPUS:85180128903

T3 - Proceedings of SPIE - The International Society for Optical Engineering

BT - Optoelectronic Imaging and Multimedia Technology X

A2 - Dai, Qionghai

A2 - Shimura, Tsutomu

A2 - Zheng, Zhenrong

PB - SPIE

T2 - Optoelectronic Imaging and Multimedia Technology X 2023

Y2 - 15 October 2023 through 16 October 2023

ER -

Spectral-Spatial Adaptive Transformer Model for Hyperspectral Image Classification

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this