Skip to main navigation Skip to search Skip to main content

Fully automated classification of pulmonary nodules in positron emission tomography-computed tomography imaging using a two-stage multimodal learning approach

  • Tongtong Li
  • , Junfeng Mao
  • , Jiandong Yu
  • , Ziyang Zhao
  • , Miao Chen
  • , Zhijun Yao*
  • , Lei Fang*
  • , Bin Hu*
  • *Corresponding author for this work
  • Lanzhou University
  • The 940th Hospital of Joint Logistics Support Force of Chinese People’s Liberation Army
  • Gansu University of Chinese Medicine
  • Taikang Tongji (Wuhan) Hospital
  • Chinese Academy of Sciences

Research output: Contribution to journalArticlepeer-review

Abstract

Background: Lung cancer is a malignant tumor, for which pulmonary nodules are considered to be significant indicators. Early recognition and timely treatment of pulmonary nodules can contribute to improving the survival rate of patients with cancer. Positron emission tomography-computed tomography (PET/CT) is a noninvasive, fusion imaging technique that can obtain both functional and structural information of lung regions. However, studies of pulmonary nodules based on computer-aided diagnosis have primarily focused on the nodule level due to a reliance on the annotation of nodules, which is superficial and unable to contribute to the actual clinical diagnosis. The aim of this study was thus to develop a fully automated classification framework for a more comprehensive assessment of pulmonary nodules in PET/CT imaging data. Methods: We developed a two-stage multimodal learning framework for the diagnosis of pulmonary nodules in PET/CT imaging. In this framework, Stage I focuses on pulmonary parenchyma segmentation using a pretrained U-Net and PET/CT registration. Stage II aims to extract, integrate, and recognize image-level and feature-level features by employing the three-dimensional (3D) Inception-residual net (ResNet) convolutional block attention module architecture and a dense-voting fusion mechanism. Results: In the experiments, the proposed model’s performance was comprehensively validated using a set of real clinical data, achieving mean scores of 89.98%, 89.21%, 84.75%, 93.38%, 86.83%, and 0.9227 for accuracy, precision, recall, specificity, F1 score, and area under curve values, respectively. Conclusions: This paper presents a two-stage multimodal learning approach for the automatic diagnosis of pulmonary nodules. The findings reveal that the main reason for limiting model performance is the nonsolitary property of nodules in pulmonary nodule diagnosis, providing direction for future research.

Original languageEnglish
Pages (from-to)5526-5540
Number of pages15
JournalQuantitative Imaging in Medicine and Surgery
Volume14
Issue number8
DOIs
Publication statusPublished - Aug 2024

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Pulmonary nodule classification
  • deep learning
  • multimodal
  • positron emission tomography-computed tomography (PET/CT)
  • two-stage

Fingerprint

Dive into the research topics of 'Fully automated classification of pulmonary nodules in positron emission tomography-computed tomography imaging using a two-stage multimodal learning approach'. Together they form a unique fingerprint.

Cite this