TY - JOUR
T1 - Clusterformer for Pine Tree Disease Identification Based on UAV Remote Sensing Image Segmentation
AU - Liu, Huan
AU - Li, Wei
AU - Jia, Wen
AU - Sun, Hong
AU - Zhang, Mengmeng
AU - Song, Lujie
AU - Gui, Yuanyuan
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - Pine wilt disease (PWD) is one of the most prevalent pine tree diseases, resulting in both ecological and economic havoc. Unmanned aerial vehicle (UAV) remote sensing segmentation plays a crucial role in early identifying and preventing PWD. However, deep learning segmentation models customized for PWD identification in scenarios with complex backgrounds have not received extensive exploration. In this article, we propose a novel UAV remote sensing segmentation model called Clusterformer with a conventional encoder-decoder structure. The encoder is comprised of the specially designed cluster transformer, which includes a cluster token mixer and a spatial-channel feed-forward network (SC-FFN). The cluster token mixer utilizes constructed clusters from the feature maps to represent pixels, thereby reducing redundant and interfering information. The SC-FFN extracts multiscale spatial information through depth-wise convolutions and channel information through a multilayer perceptron (MLP) in sequence. The decoder primarily consists of the specially designed D-cluster transformer. The token mixer of the D-cluster transformer employs constructed clusters from high-level decoded tokens to represent low-level encoded tokens without relying on traditional upsampling methods such as interpolation, transpose convolution, or patch expansion. Consequently, more robust and less redundant features from high-level decoded feature maps are transferred to low-level encoded feature maps. Experimental results on two PWD datasets demonstrate that Clusterformer outperforms existing state-of-the-art segmentation models. This confirms the effectiveness and efficiency of Clusterformer in PWD identification. The code is available at https://github.com/huanliu233/Clusterformer.
AB - Pine wilt disease (PWD) is one of the most prevalent pine tree diseases, resulting in both ecological and economic havoc. Unmanned aerial vehicle (UAV) remote sensing segmentation plays a crucial role in early identifying and preventing PWD. However, deep learning segmentation models customized for PWD identification in scenarios with complex backgrounds have not received extensive exploration. In this article, we propose a novel UAV remote sensing segmentation model called Clusterformer with a conventional encoder-decoder structure. The encoder is comprised of the specially designed cluster transformer, which includes a cluster token mixer and a spatial-channel feed-forward network (SC-FFN). The cluster token mixer utilizes constructed clusters from the feature maps to represent pixels, thereby reducing redundant and interfering information. The SC-FFN extracts multiscale spatial information through depth-wise convolutions and channel information through a multilayer perceptron (MLP) in sequence. The decoder primarily consists of the specially designed D-cluster transformer. The token mixer of the D-cluster transformer employs constructed clusters from high-level decoded tokens to represent low-level encoded tokens without relying on traditional upsampling methods such as interpolation, transpose convolution, or patch expansion. Consequently, more robust and less redundant features from high-level decoded feature maps are transferred to low-level encoded feature maps. Experimental results on two PWD datasets demonstrate that Clusterformer outperforms existing state-of-the-art segmentation models. This confirms the effectiveness and efficiency of Clusterformer in PWD identification. The code is available at https://github.com/huanliu233/Clusterformer.
KW - Cluster transformer
KW - pine wilt identification
KW - semantic segmentation
KW - unmanned aerial vehicle (UAV) remote sensing
UR - http://www.scopus.com/inward/record.url?scp=85184819866&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2024.3362877
DO - 10.1109/TGRS.2024.3362877
M3 - Article
AN - SCOPUS:85184819866
SN - 0196-2892
VL - 62
SP - 1
EP - 15
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5609215
ER -