Cross-modal attention network for retinal disease classification based on multi-modal images

Zirong Liu, Yan Hu, Zhongxi Qiu, Yanyan Niu, Dan Zhou, Xiaoling Li, Junyong Shen, Hongyang Jiang, Heng Li, Jiang Liu*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Multi-modal eye disease screening improves diagnostic accuracy by providing lesion information from different sources. However, existing multi-modal automatic diagnosis methods tend to focus on the specificity of modalities and ignore the spatial correlation of images. This paper proposes a novel cross-modal retinal disease diagnosis network (CRD-Net) that digs out the relevant features from modal images aided for multiple retinal disease diagnosis. Specifically, our model introduces a cross-modal attention (CMA) module to query and adaptively pay attention to the relevant features of the lesion in the different modal images. In addition, we also propose multiple loss functions to fuse features with modality correlation and train a multi-modal retinal image classification network to achieve a more accurate diagnosis. Experimental evaluation on three publicly available datasets shows that our CRD-Net outperforms existing single-modal and multi-modal methods, demonstrating its superior performance.

源语言英语
页(从-至)3699-3714
页数16
期刊Biomedical Optics Express
15
6
DOI
出版状态已出版 - 1 6月 2024
已对外发布

指纹

探究 'Cross-modal attention network for retinal disease classification based on multi-modal images' 的科研主题。它们共同构成独一无二的指纹。

引用此