Dual Encoders Neural Network for Medical Image Segmentation

Xin Wang*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the field of medical image segmentation, UNet has emerged as a widely utilized backbone network architecture. The emergence of deep learning techniques such as convolutional neural networks (CNNs), attention mechanisms, and Transformer has provided a foundation for building newer and more powerful versions of UNet. The pure CNNs-based UNet has demonstrated excellent performance in medical image segmentation, and recently, the pure Transformer-based UNet has achieved even better segmentation results. Owing to their local inductive bias, CNNs excel at capturing local features and generate fine but potentially incomplete results, whereas Transformers excel at capturing global context and generate complete but less detailed results. Recently, some studies have explored the integration of CNNs and Transformers, achieving promising performance. In this paper, we introduce a novel dual encoders architecture that combines Swin Transformer and CNNs. Unlike prior methods, our architecture comprises two distinct sets of encoders: one leveraging Swin Transformer and the other utilizing CNNs. Furthermore, a spatial-channel attention-based fusion(SCAF) module is designed to effectively fuse the outputs. These innovative designs empower our network to effectively grasp both global context and local textural details, thereby enhancing the performance of medical image segmentation. Experimental results demonstrate that the proposed method outperforms previous state-of-the-art methods on both the Synapse multi-organ CT dataset and the ACDC dataset.

Original languageEnglish
Title of host publication2024 5th International Conference on Computer Engineering and Application, ICCEA 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages905-909
Number of pages5
ISBN (Electronic)9798350386776
DOIs
Publication statusPublished - 2024
Externally publishedYes
Event5th International Conference on Computer Engineering and Application, ICCEA 2024 - Hybrid, Hangzhou, China
Duration: 12 Apr 202414 Apr 2024

Publication series

Name2024 5th International Conference on Computer Engineering and Application, ICCEA 2024

Conference

Conference5th International Conference on Computer Engineering and Application, ICCEA 2024
Country/TerritoryChina
CityHybrid, Hangzhou
Period12/04/2414/04/24

Keywords

  • convolutional neural network
  • deep learning
  • medical image segmentaion
  • transformer
  • unet

Fingerprint

Dive into the research topics of 'Dual Encoders Neural Network for Medical Image Segmentation'. Together they form a unique fingerprint.

Cite this

Wang, X. (2024). Dual Encoders Neural Network for Medical Image Segmentation. In 2024 5th International Conference on Computer Engineering and Application, ICCEA 2024 (pp. 905-909). (2024 5th International Conference on Computer Engineering and Application, ICCEA 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCEA62105.2024.10603676