TY - GEN
T1 - Research on the Role of Feature Factors in Pneumoconiosis Staging Based on a Pneumoconiosis Database
AU - Cao, Qian
AU - Wei, Yangyang
AU - Shi, Yiwei
AU - Wen, Junhai
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2026/4/13
Y1 - 2026/4/13
N2 - China is the world’s largest coal producer, and pneumoconiosis remains the most widespread and severe occupational disease in the country. In recent years, artificial intelligence (AI) and radiomics technologies have attracted increasing attention in the diagnosis of coal workers’ pneumoconiosis (CWP). In this study, data extracted from the pneumoconiosis database were used to investigate the influence and contribution of multiple feature factors to disease staging through AI-based modeling. Using variables including age, dust exposure years, total working years, and smoking years, we extracted data from patients at different stages of pneumoconiosis. Two binary classification models were constructed using the XGBoost classifier: Model 1 (Stage I vs. Stages II–III, label: 0 vs. 1) and Model 2 (Stage II vs. Stage III, label: 0 vs. 1). The analysis revealed that in the early progression phase (Stage I → Stages II + III), Dust Exposure Years served as the principal driving factor, whereas in the later phase (Stage II → Stage III), Age became the most critical predictor. Total Working Years, although relatively weak in the early stage, demonstrated a moderate influence in later-stage progression, particularly contributing to the likelihood of Stage III development. Across both stages, Smoking Years functioned as a weakly synergistic factor, exerting only minimal influence on overall disease progression.
AB - China is the world’s largest coal producer, and pneumoconiosis remains the most widespread and severe occupational disease in the country. In recent years, artificial intelligence (AI) and radiomics technologies have attracted increasing attention in the diagnosis of coal workers’ pneumoconiosis (CWP). In this study, data extracted from the pneumoconiosis database were used to investigate the influence and contribution of multiple feature factors to disease staging through AI-based modeling. Using variables including age, dust exposure years, total working years, and smoking years, we extracted data from patients at different stages of pneumoconiosis. Two binary classification models were constructed using the XGBoost classifier: Model 1 (Stage I vs. Stages II–III, label: 0 vs. 1) and Model 2 (Stage II vs. Stage III, label: 0 vs. 1). The analysis revealed that in the early progression phase (Stage I → Stages II + III), Dust Exposure Years served as the principal driving factor, whereas in the later phase (Stage II → Stage III), Age became the most critical predictor. Total Working Years, although relatively weak in the early stage, demonstrated a moderate influence in later-stage progression, particularly contributing to the likelihood of Stage III development. Across both stages, Smoking Years functioned as a weakly synergistic factor, exerting only minimal influence on overall disease progression.
KW - artificial intelligence
KW - pneumoconiosis
KW - SHAP values
KW - XGBoost classifier
UR - https://www.scopus.com/pages/publications/105037322366
U2 - 10.1145/3796731.3796746
DO - 10.1145/3796731.3796746
M3 - Conference contribution
AN - SCOPUS:105037322366
T3 - Proceedings of 2025 5th International Conference on Computational Modeling, Simulation and Data Analysis, CMSDA 2025
SP - 90
EP - 95
BT - Proceedings of 2025 5th International Conference on Computational Modeling, Simulation and Data Analysis, CMSDA 2025
PB - Association for Computing Machinery, Inc
T2 - 2025 5th International Conference on Computational Modeling, Simulation and Data Analysis, CMSDA 2025
Y2 - 14 December 2025 through 16 December 2025
ER -