AD-DUNet: A dual-branch encoder approach by combining axial Transformer with cascaded dilated convolutions for liver and hepatic tumor segmentation

Hang Qi, Weijiang Wang, Yueting Shi, Xiaohua Wang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 4
  • Captures
    • Readers: 9
  • Mentions
    • News Mentions: 1
see details

Abstract

Liver cancer remains a significant health concern, and accurate segmentation in CT scans is crucial for diagnosis and treatment. Deep learning-based auxiliary diagnosis techniques, especially utilizing U-shaped structures, are widely employed in medical image segmentation. However, traditional methods that utilize Convolutional Neural Networks (CNNs) generally have limitations in modeling long-range dependencies. Inspired by the success of Transformers in various vision tasks, approaches that combine Transformers with CNNs have been spurred. However, many existing hybrid CNN-Transformer models are prone to yielding poor performance on relative small-scale medical image datasets when trained from scratch. Moreover, some of these methods involve additional fusion modules customized, which introduce extra workload and parameters to the model. To address these limitations, we propose AD-DUNet, a hybrid CNN-Transformer model for liver and hepatic tumor segmentation, which comprises a dual-branch encoder and a residual decoder. The Transformer-based encoder, utilizing Axial Transformer (AT) blocks, efficiently captures long-range dependencies across the entire image, while the CNN-based encoder, constructed with cascaded dilated convolutions (CDC) blocks, extracts fine-grained local features. The two encoders synergize in the shared residual decoder, eliminating the need for additional fusion modules. The extensive experiments conducted on the LiTS2017 and 3DIRCAD datasets demonstrate the superiority of AD-DUNet over existing models. Remarkably, our approach achieves state-of-the-art results without relying on pre-trained weights, showcasing its efficiency with low complexity and 4.24M parameters.

Original languageEnglish
Article number106397
JournalBiomedical Signal Processing and Control
Volume95
DOIs
Publication statusPublished - Sept 2024

Keywords

  • Convolutional neural network
  • Deep learning
  • Dual-branch encoder
  • Medical image segmentation
  • Transformer

Fingerprint

Dive into the research topics of 'AD-DUNet: A dual-branch encoder approach by combining axial Transformer with cascaded dilated convolutions for liver and hepatic tumor segmentation'. Together they form a unique fingerprint.

Cite this

Qi, H., Wang, W., Shi, Y., & Wang, X. (2024). AD-DUNet: A dual-branch encoder approach by combining axial Transformer with cascaded dilated convolutions for liver and hepatic tumor segmentation. Biomedical Signal Processing and Control, 95, Article 106397. https://doi.org/10.1016/j.bspc.2024.106397