AD-DUNet: A dual-branch encoder approach by combining axial Transformer with cascaded dilated convolutions for liver and hepatic tumor segmentation

Hang Qi, Weijiang Wang, Yueting Shi, Xiaohua Wang*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Liver cancer remains a significant health concern, and accurate segmentation in CT scans is crucial for diagnosis and treatment. Deep learning-based auxiliary diagnosis techniques, especially utilizing U-shaped structures, are widely employed in medical image segmentation. However, traditional methods that utilize Convolutional Neural Networks (CNNs) generally have limitations in modeling long-range dependencies. Inspired by the success of Transformers in various vision tasks, approaches that combine Transformers with CNNs have been spurred. However, many existing hybrid CNN-Transformer models are prone to yielding poor performance on relative small-scale medical image datasets when trained from scratch. Moreover, some of these methods involve additional fusion modules customized, which introduce extra workload and parameters to the model. To address these limitations, we propose AD-DUNet, a hybrid CNN-Transformer model for liver and hepatic tumor segmentation, which comprises a dual-branch encoder and a residual decoder. The Transformer-based encoder, utilizing Axial Transformer (AT) blocks, efficiently captures long-range dependencies across the entire image, while the CNN-based encoder, constructed with cascaded dilated convolutions (CDC) blocks, extracts fine-grained local features. The two encoders synergize in the shared residual decoder, eliminating the need for additional fusion modules. The extensive experiments conducted on the LiTS2017 and 3DIRCAD datasets demonstrate the superiority of AD-DUNet over existing models. Remarkably, our approach achieves state-of-the-art results without relying on pre-trained weights, showcasing its efficiency with low complexity and 4.24M parameters.

源语言英语
文章编号106397
期刊Biomedical Signal Processing and Control
95
DOI
出版状态已出版 - 9月 2024

指纹

探究 'AD-DUNet: A dual-branch encoder approach by combining axial Transformer with cascaded dilated convolutions for liver and hepatic tumor segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此