TY - JOUR
T1 - Dual-Branch Semi-Supervised Deep Learning for Improved Lung Cancer Diagnosis With Metabolomics
AU - Li, Qian
AU - Liu, Xinbo
AU - Ye, Chao
AU - Zhang, Jinze
AU - Meng, Xuanyu
AU - Guo, Jin
AU - Min, Xianjun
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2026
Y1 - 2026
N2 - Metabolomics, a comprehensive omics approach for profiling small-molecule metabolites within biological systems, has become an indispensable tool for disease diagnosis, prognostic assessment, and prediction of therapeutic response. In lung cancer research, metabolomics shows particular promise in early detection, subtype classification, treatment monitoring, and outcome prediction. Lung cancer remains one of the leading causes of cancer-related mortality worldwide, characterized by high treatment costs and persistently low five-year survival rates, underscoring the urgent need for improved diagnostic strategies. The Lung Cancer Metabolome Database (LCMD) is a pioneering, freely accessible online resource that aggregates lung cancer–associated metabolites identified from mass spectrometry–based studies, providing a valuable foundation for computational modeling. However, metabolomics data are often challenged by limited labeled samples and high levels of missingness. To address these challenges, we propose a novel Dual-Branch Semi-Supervised Deep Learning (DBSL) framework for lung cancer metabolomics analysis. The model incorporates a Multi-Features Consistency Attention (MFCA) mechanism to jointly capture global temporal patterns and local feature interactions. In addition, a semi-supervised optimization strategy integrating pseudo-labeling, stochastic data augmentation, and consistency regularization is designed to effectively exploit unlabeled data. Under extreme low-label regimes (5%, 10%, and 15% labeled data), DBSL achieves state-of-the-art performance on the LCMD dataset. These results demonstrate the strong potential of DBSL as a scalable and cost-effective approach for lung cancer diagnosis, contributing to the urgent clinical demand for early detection and intervention.
AB - Metabolomics, a comprehensive omics approach for profiling small-molecule metabolites within biological systems, has become an indispensable tool for disease diagnosis, prognostic assessment, and prediction of therapeutic response. In lung cancer research, metabolomics shows particular promise in early detection, subtype classification, treatment monitoring, and outcome prediction. Lung cancer remains one of the leading causes of cancer-related mortality worldwide, characterized by high treatment costs and persistently low five-year survival rates, underscoring the urgent need for improved diagnostic strategies. The Lung Cancer Metabolome Database (LCMD) is a pioneering, freely accessible online resource that aggregates lung cancer–associated metabolites identified from mass spectrometry–based studies, providing a valuable foundation for computational modeling. However, metabolomics data are often challenged by limited labeled samples and high levels of missingness. To address these challenges, we propose a novel Dual-Branch Semi-Supervised Deep Learning (DBSL) framework for lung cancer metabolomics analysis. The model incorporates a Multi-Features Consistency Attention (MFCA) mechanism to jointly capture global temporal patterns and local feature interactions. In addition, a semi-supervised optimization strategy integrating pseudo-labeling, stochastic data augmentation, and consistency regularization is designed to effectively exploit unlabeled data. Under extreme low-label regimes (5%, 10%, and 15% labeled data), DBSL achieves state-of-the-art performance on the LCMD dataset. These results demonstrate the strong potential of DBSL as a scalable and cost-effective approach for lung cancer diagnosis, contributing to the urgent clinical demand for early detection and intervention.
KW - Lung cancer metabolomic diagnosis
KW - dual-branch feature fusion
KW - multi-features consistency attention
KW - semi-supervised deep learning
UR - https://www.scopus.com/pages/publications/105026388389
U2 - 10.1109/ACCESS.2025.3648945
DO - 10.1109/ACCESS.2025.3648945
M3 - Article
AN - SCOPUS:105026388389
SN - 2169-3536
VL - 14
SP - 4792
EP - 4803
JO - IEEE Access
JF - IEEE Access
ER -