FD-DUNet: Frequency Domain Global Modeling Enhances Receptive Field Expansion UNet for Efficient Medical Image Segmentation

Hang Qi, Weijiang Wang, Chuxuan Shan, Xiaohua Wang*, Minli Jia, Hua Dang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Medical image segmentation is crucial for computer-aided diagnosis, facilitating lesion identification. U-shaped structures have been prevalent in this domain, yet traditional methods employing Convolutional Neural Networks (CNNs) face challenges in modeling long-range dependencies. Consequently, there is a shift towards integrating global modeling mechanisms with CNNs. However, prevailing methods for global feature extraction in the spatial domain often encounter quadratic complexity, resulting in high parameters and rendering training on limited datasets less than optimal. To address this issue, we propose FD-DUNet, an efficient model with a global-local interaction architecture. The global encoder incorporates the Frequency Domain Global Modeling (FDGM) blocks, leveraging fast Fourier transform for capturing long-range dependencies with log-linear complexity. Meanwhile, the local encoder features Receptive Field Expansion (RFE) blocks, gradually widening the receptive field to extract fine-grained features. Additionally, our novel lightweight decoder combines a multi-path feature aggregation approach with the zero-parameter Spatial-shift operation to integrate global-local information effectively. Experimental evaluations on ISIC-2018 and BUSI datasets demonstrate the superior performance of FD-DUNet over existing methods, showing its efficiency with only 3.70 GFLOPs and 3.28 million parameters.

Original languageEnglish
Title of host publicationAdvanced Intelligent Computing Technology and Applications - 20th International Conference, ICIC 2024, Proceedings
EditorsDe-Shuang Huang, Yijie Pan, Xiankun Zhang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages68-80
Number of pages13
ISBN (Print)9789819755806
DOIs
Publication statusPublished - 2024
Event20th International Conference on Intelligent Computing, ICIC 2024 - Tianjin, China
Duration: 5 Aug 20248 Aug 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14863 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th International Conference on Intelligent Computing, ICIC 2024
Country/TerritoryChina
CityTianjin
Period5/08/248/08/24

Keywords

  • Convolutional Neural Networks
  • Fourier Transform
  • Global Filter
  • Medical Image Segmentation
  • Spatial-shift

Fingerprint

Dive into the research topics of 'FD-DUNet: Frequency Domain Global Modeling Enhances Receptive Field Expansion UNet for Efficient Medical Image Segmentation'. Together they form a unique fingerprint.

Cite this