TY - JOUR
T1 - Microscopic Hyperspectral Image Classification Based on Fusion Transformer with Parallel CNN
AU - Zeng, Weijia
AU - Li, Wei
AU - Zhang, Mengmeng
AU - Wang, Hao
AU - Lv, Meng
AU - Yang, Yue
AU - Tao, Ran
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2023/6/1
Y1 - 2023/6/1
N2 - Microscopic hyperspectral image (MHSI) has received considerable attention in the medical field. The wealthy spectral information provides potentially powerful identification ability when combining with advanced convolutional neural network (CNN). However, for high-dimensional MHSI, the local connection of CNN makes it difficult to extract the long-range dependencies of spectral bands. Transformer overcomes this problem well because of its self-attention mechanism. Nevertheless, transformer is inferior to CNN in extracting spatial detailed features. Therefore, a classification framework integrating transformer and CNN in parallel, named as Fusion Transformer (FUST), is proposed for MHSI classification tasks. Specifically, the transformer branch is employed to extract the overall semantics and capture the long-range dependencies of spectral bands to highlight the key spectral information. The parallel CNN branch is designed to extract significant multiscale spatial features. Furthermore, the feature fusion module is developed to effectively fuse and process the features extracted by the two branches. Experimental results on three MHSI datasets demonstrate that the proposed FUST achieves superior performance when compared with state-of-the-art methods.
AB - Microscopic hyperspectral image (MHSI) has received considerable attention in the medical field. The wealthy spectral information provides potentially powerful identification ability when combining with advanced convolutional neural network (CNN). However, for high-dimensional MHSI, the local connection of CNN makes it difficult to extract the long-range dependencies of spectral bands. Transformer overcomes this problem well because of its self-attention mechanism. Nevertheless, transformer is inferior to CNN in extracting spatial detailed features. Therefore, a classification framework integrating transformer and CNN in parallel, named as Fusion Transformer (FUST), is proposed for MHSI classification tasks. Specifically, the transformer branch is employed to extract the overall semantics and capture the long-range dependencies of spectral bands to highlight the key spectral information. The parallel CNN branch is designed to extract significant multiscale spatial features. Furthermore, the feature fusion module is developed to effectively fuse and process the features extracted by the two branches. Experimental results on three MHSI datasets demonstrate that the proposed FUST achieves superior performance when compared with state-of-the-art methods.
KW - Convolutional neural network (CNN)
KW - feature fusion
KW - microscopic hyperspectral image (MHSI)
KW - transformer
UR - http://www.scopus.com/inward/record.url?scp=85149827224&partnerID=8YFLogxK
U2 - 10.1109/JBHI.2023.3253722
DO - 10.1109/JBHI.2023.3253722
M3 - Article
C2 - 37028325
AN - SCOPUS:85149827224
SN - 2168-2194
VL - 27
SP - 2910
EP - 2921
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
IS - 6
ER -