TY - GEN
T1 - N-Gram Swin Transformer for CT Image Super-Resolution
AU - Gao, Zhenghao
AU - Ai, Danni
AU - Li, Wentao
AU - Song, Hong
AU - Yang, Jian
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - The insufficient resolution of medical images, especially the low spatial resolution in the depth direction, may lead to the loss of critical information, thereby affecting the accuracy of medical diagnosis. Super-resolution (SR) technology plays a crucial role in medical imaging by enhancing image resolution to provide more detailed structural information. However, traditional single-image super-resolution (SISR) methods struggle to fully exploit 3D spatial information, resulting in insufficient spatial consistency between slices, which leads to artifacts and discontinuous textures, limiting their applicability in 3D medical image reconstruction. To address these challenges, this paper proposes the N-gram Swin Transformer Network (NGSWN) for super-resolution of CT images, specifically aiming to address the issue of insufficient resolution in the depth direction. The proposed model adopts an asymmetric encoder-decoder structure and integrates an N-gram-based mechanism to enhance feature extraction and reconstruction capabilities. By leveraging spatial relationships between slices, the NGSWN generates high-resolution CT images with better continuity and fewer artifacts. Experimental results demonstrate that the NGSWN outperforms both traditional and state-of-the-art methods in terms of PSNR and SSIM metrics, highlighting its significant potential for enhancing medical imaging quality and improving diagnostic accuracy.
AB - The insufficient resolution of medical images, especially the low spatial resolution in the depth direction, may lead to the loss of critical information, thereby affecting the accuracy of medical diagnosis. Super-resolution (SR) technology plays a crucial role in medical imaging by enhancing image resolution to provide more detailed structural information. However, traditional single-image super-resolution (SISR) methods struggle to fully exploit 3D spatial information, resulting in insufficient spatial consistency between slices, which leads to artifacts and discontinuous textures, limiting their applicability in 3D medical image reconstruction. To address these challenges, this paper proposes the N-gram Swin Transformer Network (NGSWN) for super-resolution of CT images, specifically aiming to address the issue of insufficient resolution in the depth direction. The proposed model adopts an asymmetric encoder-decoder structure and integrates an N-gram-based mechanism to enhance feature extraction and reconstruction capabilities. By leveraging spatial relationships between slices, the NGSWN generates high-resolution CT images with better continuity and fewer artifacts. Experimental results demonstrate that the NGSWN outperforms both traditional and state-of-the-art methods in terms of PSNR and SSIM metrics, highlighting its significant potential for enhancing medical imaging quality and improving diagnostic accuracy.
KW - N-gram
KW - Super-Resolution
KW - Swin Transformer
UR - http://www.scopus.com/inward/record.url?scp=105002588898&partnerID=8YFLogxK
U2 - 10.1007/978-981-96-3679-2_9
DO - 10.1007/978-981-96-3679-2_9
M3 - Conference contribution
AN - SCOPUS:105002588898
SN - 9789819636785
T3 - Lecture Notes in Computer Science
SP - 136
EP - 148
BT - Extended Reality - 1st International Conference, ICXR 2024, Proceedings
A2 - Song, Weitao
A2 - Guan, Frank
A2 - Li, Shuai
A2 - Zhang, Guofeng
PB - Springer Science and Business Media Deutschland GmbH
T2 - 1st International Conference on Extended Reality, ICXR 2024
Y2 - 14 November 2024 through 17 November 2024
ER -