More than Encoder: Introducing Transformer Decoder to Upsample

Yijiang Li, Wentian Cai, Ying Gao*, Chengming Li, Xiping Hu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

27 引用 (Scopus)

摘要

Medical image segmentation methods downsample images for feature extraction and then upsample them to restore resolution for pixel-level predictions. In such schema, upsample technique is vital in restoring information for better performance. However, existing upsample techniques leverage little information from downsampling paths. The local and detailed feature from the shallower layer such as boundary and tissue texture is crucial in segmentation, especially medical image segmentation. To this end, we propose a novel upsample approach for medical image segmentation, Window Attention Upsample (WAU), which upsamples features conditioned on local and detailed features from downsampling path in local windows by introducing attention decoders of Transformer. WAU could serve as a general upsample method and be incorporated into any segmentation model that possesses lateral connections. We first propose the Attention Upsample which consists of Attention Decoder (AD) and bilinear upsample. AD leverages pixel-level attention to model longrange dependency and global information for a better upsample. Bilinear upsample is introduced as the residual connection to complement the upsampled features. Moreover, considering the extensive memory and computation cost of pixel-level attention, we further design a window attention scheme to restrict attention computation in local windows instead of the global range. We evaluate our method (WAU) on classic UNet structure with lateral connections and achieve state-of-the-art performance on Medical Segmentation Decathlon (MSD) Brain and Automatic Cardiac Diagnosis Challenge (ACDC) datasets. We also validate the effectiveness of our method on multiple classic architectures and achieve consistent improvement.

源语言英语
主期刊名Proceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
编辑Donald Adjeroh, Qi Long, Xinghua Shi, Fei Guo, Xiaohua Hu, Srinivas Aluru, Giri Narasimhan, Jianxin Wang, Mingon Kang, Ananda M. Mondal, Jin Liu
出版商Institute of Electrical and Electronics Engineers Inc.
1597-1602
页数6
ISBN(电子版)9781665468190
DOI
出版状态已出版 - 2022
活动2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022 - Las Vegas, 美国
期限: 6 12月 20228 12月 2022

出版系列

姓名Proceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022

会议

会议2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
国家/地区美国
Las Vegas
时期6/12/228/12/22

指纹

探究 'More than Encoder: Introducing Transformer Decoder to Upsample' 的科研主题。它们共同构成独一无二的指纹。

引用此