跳到主要导航 跳到搜索 跳到主要内容

MedMM: A Multimodal Fusion Framework for 3D Medical Image Classification with Multigranular Text Guidance

  • Shanbo Zhao
  • , Meihui Zhang*
  • , Xiaoqin Zhu
  • , Junjie Li
  • , Yunyun Duan
  • , Zhizheng Zhuo
  • , Yaou Liu
  • , Chuyang Ye
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • Capital Medical University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Deep learning approaches are widely used in medical image analysis and have shown impressive results on many analytical tasks. However, textual information related to medical images are often underutilized in existing methods, despite the great semantic value and potential multigranular guidance in medical image analysis. Meanwhile, many medical images, like magnetic resonance (MR) images are usually in 3D format consisting of multiple slices which contain more complex and redundant information, making them especially hard to be represented. In this paper, we propose a multimodal funsion framework for 3D medical image classification, which utilizes the medical text paired with the 3D medical image to guide the generation and aggregation of image features. Results show that our method significantly outperforms uni-modal and multimodal baseline methods. Ablation studies validate the effectiveness of each component, and visualization results also reveal the strong ability of our model on capturing fine-grained and coarse-grained information.

源语言英语
主期刊名Proceedings - 2024 10th International Conference on Big Data Computing and Communications, BIGCOM 2024
出版商Institute of Electrical and Electronics Engineers Inc.
42-49
页数8
版本2024
ISBN(电子版)9798331509538
DOI
出版状态已出版 - 2024
已对外发布
活动10th International Conference on Big Data Computing and Communications, BIGCOM 2024 - Dalian, 中国
期限: 9 8月 202411 8月 2024

会议

会议10th International Conference on Big Data Computing and Communications, BIGCOM 2024
国家/地区中国
Dalian
时期9/08/2411/08/24

指纹

探究 'MedMM: A Multimodal Fusion Framework for 3D Medical Image Classification with Multigranular Text Guidance' 的科研主题。它们共同构成独一无二的指纹。

引用此