TY - GEN
T1 - FD-DUNet
T2 - 20th International Conference on Intelligent Computing, ICIC 2024
AU - Qi, Hang
AU - Wang, Weijiang
AU - Shan, Chuxuan
AU - Wang, Xiaohua
AU - Jia, Minli
AU - Dang, Hua
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
PY - 2024
Y1 - 2024
N2 - Medical image segmentation is crucial for computer-aided diagnosis, facilitating lesion identification. U-shaped structures have been prevalent in this domain, yet traditional methods employing Convolutional Neural Networks (CNNs) face challenges in modeling long-range dependencies. Consequently, there is a shift towards integrating global modeling mechanisms with CNNs. However, prevailing methods for global feature extraction in the spatial domain often encounter quadratic complexity, resulting in high parameters and rendering training on limited datasets less than optimal. To address this issue, we propose FD-DUNet, an efficient model with a global-local interaction architecture. The global encoder incorporates the Frequency Domain Global Modeling (FDGM) blocks, leveraging fast Fourier transform for capturing long-range dependencies with log-linear complexity. Meanwhile, the local encoder features Receptive Field Expansion (RFE) blocks, gradually widening the receptive field to extract fine-grained features. Additionally, our novel lightweight decoder combines a multi-path feature aggregation approach with the zero-parameter Spatial-shift operation to integrate global-local information effectively. Experimental evaluations on ISIC-2018 and BUSI datasets demonstrate the superior performance of FD-DUNet over existing methods, showing its efficiency with only 3.70 GFLOPs and 3.28 million parameters.
AB - Medical image segmentation is crucial for computer-aided diagnosis, facilitating lesion identification. U-shaped structures have been prevalent in this domain, yet traditional methods employing Convolutional Neural Networks (CNNs) face challenges in modeling long-range dependencies. Consequently, there is a shift towards integrating global modeling mechanisms with CNNs. However, prevailing methods for global feature extraction in the spatial domain often encounter quadratic complexity, resulting in high parameters and rendering training on limited datasets less than optimal. To address this issue, we propose FD-DUNet, an efficient model with a global-local interaction architecture. The global encoder incorporates the Frequency Domain Global Modeling (FDGM) blocks, leveraging fast Fourier transform for capturing long-range dependencies with log-linear complexity. Meanwhile, the local encoder features Receptive Field Expansion (RFE) blocks, gradually widening the receptive field to extract fine-grained features. Additionally, our novel lightweight decoder combines a multi-path feature aggregation approach with the zero-parameter Spatial-shift operation to integrate global-local information effectively. Experimental evaluations on ISIC-2018 and BUSI datasets demonstrate the superior performance of FD-DUNet over existing methods, showing its efficiency with only 3.70 GFLOPs and 3.28 million parameters.
KW - Convolutional Neural Networks
KW - Fourier Transform
KW - Global Filter
KW - Medical Image Segmentation
KW - Spatial-shift
UR - http://www.scopus.com/inward/record.url?scp=85201113819&partnerID=8YFLogxK
U2 - 10.1007/978-981-97-5581-3_6
DO - 10.1007/978-981-97-5581-3_6
M3 - Conference contribution
AN - SCOPUS:85201113819
SN - 9789819755806
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 68
EP - 80
BT - Advanced Intelligent Computing Technology and Applications - 20th International Conference, ICIC 2024, Proceedings
A2 - Huang, De-Shuang
A2 - Pan, Yijie
A2 - Zhang, Xiankun
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 5 August 2024 through 8 August 2024
ER -