跳到主要导航 跳到搜索 跳到主要内容

MI2A: A Multimodal Information Interaction Architecture for Automated Diagnosis of Lung Nodules Using PET/CT Imaging

  • Kai Li
  • , Tongtong Li
  • , Lei Zhang
  • , Junfeng Mao
  • , Xuerong Shi
  • , Zhijun Yao*
  • , Lei Fang*
  • , Bin Hu*
  • *此作品的通讯作者
  • Lanzhou University
  • Hexi University
  • The 940th Hospital of Joint Logistics Support Force of Chinese PLA
  • Gansu University of Chinese Medicine
  • Taikang Tongji (Wuhan) Hospital
  • Beijing Institute of Technology
  • Chinese Academy of Sciences

科研成果: 期刊稿件文章同行评审

摘要

Lung cancer is one of the most common malignancies globally, with malignant nodules being an early indicator of the disease. Thus, accurate early diagnosis of lung nodules is imperative. Positron emission tomography–computed tomography (PET/CT) is a noninvasive imaging technique that provides both anatomical and metabolic information, playing a crucial role in the diagnosis of cancer. Existing deep learning-based multimodal fusion strategies often rely on the simple concatenation of features from two modalities, overlooking the intricate interactions between them. In this study, we proposed a multimodal information interaction framework named multimodal information interaction architecture (MI2A) for the automated diagnosis of lung nodules using PET/CT imaging. Specifically, the lung parenchymal regions were cropped as regions of interest (ROIs) using a pretrained U-Net model. Second, higher-order multimodal features from PET/CT scans were extracted and integrated using a custom-designed PET–CT imaging encoder (PCIE) module and a cross-attention multimodal encoder (CAME) module, respectively. Predictions were generated using multipath pooling layers and a multilayer perceptron (MLP) layer. Furthermore, an alignment loss function was designed to minimize the discrepancy between modality features during training. Finally, the proposed model was evaluated on an actual clinical dataset, achieving accuracy (Acc), precision (Prec), recall (Rec), specificity (Spec), and the F1-score (F-1) of 0.9179, 0.8972, 0.8937, 0.9335, and 0.8954, respectively. In addition, the findings revealed that certain benign lesions, particularly those related to inflammatory or infectious conditions, displayed high metabolic activity, which is the main reason for limiting the model’s performance. This insight provides a promising direction for future research[Figure Presented].

源语言英语
页(从-至)28547-28559
页数13
期刊IEEE Sensors Journal
25
15
DOI
出版状态已出版 - 2025
已对外发布

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 3 - 良好健康与福祉
    可持续发展目标 3 良好健康与福祉

指纹

探究 'MI2A: A Multimodal Information Interaction Architecture for Automated Diagnosis of Lung Nodules Using PET/CT Imaging' 的科研主题。它们共同构成独一无二的指纹。

引用此