TY - JOUR
T1 - A dual-attention based coupling network for diabetes classification with heterogeneous data
AU - Wang, Lei
AU - Pan, Zhenglin
AU - Liu, Wei
AU - Wang, Junzheng
AU - Ji, Linong
AU - Shi, Dawei
N1 - Publisher Copyright:
© 2023 Elsevier Inc.
PY - 2023/3
Y1 - 2023/3
N2 - Diabetes Mellitus (DM) is a group of metabolic disorders characterized by hyperglycaemia in the absence of treatment. Classification of DM is essential as it corresponds to the respective diagnosis and treatment. In this paper, we propose a new coupling network with hierarchical dual-attention that utilizes heterogeneous data, including Flash Glucose Monitoring (FGM) data and biomarkers in electronic medical records. The long short-term memory-based FGM sub-network extracts the time-dependent features of dynamic FGM sequences, while the biomarkers sub-network learns the features of static biomarkers. The convolutional block attention module (CBAM) for dispersing the feature weights of the spatial and channel dimensions is implemented into the FGM sub-network to endure the variability of FGM and allows us to extract high-level discriminative features more accurately. To better adjust the importance weights of the characteristics of the two sub-networks, self-attention is introduced to integrate the characteristics of heterogeneous data. Based on the dataset provided by Peking University People's Hospital, the proposed method is evaluated through factorial experiments of multi-source heterogeneous data, ablation studies of various attention strategies, time consumption evaluation and quantitative evaluation. The benchmark tests reveal the proposed network achieves a type 1 and 2 diabetes classification accuracy of 95.835% and the comprehensive performance metrics, including Matthews correlation coefficient, F1-score and G-mean, are 91.333%, 94.939% and 94.937% respectively. In the factorial experiments, the proposed method reaches the maximum area under the receiver operating characteristic curve of 0.9428, which indicates the effectiveness of the coupling between the nominated sub-networks. The coupling network with a dual-attention strategy performs better than the one without or only with a single-attention strategy in the ablation study as well. In addition, the model is also tested on another data set, and the accuracy of the test reaches 94.286%, reflecting that the model is robust when it is transferred to untrained diabetes data. The experimental results show that the proposed method is feasible in the classification of diabetes types. The code is available at https://github.com/bitDalei/Diabetes-Classification-with-Heterogeneous-Data.
AB - Diabetes Mellitus (DM) is a group of metabolic disorders characterized by hyperglycaemia in the absence of treatment. Classification of DM is essential as it corresponds to the respective diagnosis and treatment. In this paper, we propose a new coupling network with hierarchical dual-attention that utilizes heterogeneous data, including Flash Glucose Monitoring (FGM) data and biomarkers in electronic medical records. The long short-term memory-based FGM sub-network extracts the time-dependent features of dynamic FGM sequences, while the biomarkers sub-network learns the features of static biomarkers. The convolutional block attention module (CBAM) for dispersing the feature weights of the spatial and channel dimensions is implemented into the FGM sub-network to endure the variability of FGM and allows us to extract high-level discriminative features more accurately. To better adjust the importance weights of the characteristics of the two sub-networks, self-attention is introduced to integrate the characteristics of heterogeneous data. Based on the dataset provided by Peking University People's Hospital, the proposed method is evaluated through factorial experiments of multi-source heterogeneous data, ablation studies of various attention strategies, time consumption evaluation and quantitative evaluation. The benchmark tests reveal the proposed network achieves a type 1 and 2 diabetes classification accuracy of 95.835% and the comprehensive performance metrics, including Matthews correlation coefficient, F1-score and G-mean, are 91.333%, 94.939% and 94.937% respectively. In the factorial experiments, the proposed method reaches the maximum area under the receiver operating characteristic curve of 0.9428, which indicates the effectiveness of the coupling between the nominated sub-networks. The coupling network with a dual-attention strategy performs better than the one without or only with a single-attention strategy in the ablation study as well. In addition, the model is also tested on another data set, and the accuracy of the test reaches 94.286%, reflecting that the model is robust when it is transferred to untrained diabetes data. The experimental results show that the proposed method is feasible in the classification of diabetes types. The code is available at https://github.com/bitDalei/Diabetes-Classification-with-Heterogeneous-Data.
KW - Coupling network
KW - Diabetes types classification
KW - Dual-attention
KW - Heterogeneous data
UR - http://www.scopus.com/inward/record.url?scp=85147797803&partnerID=8YFLogxK
U2 - 10.1016/j.jbi.2023.104300
DO - 10.1016/j.jbi.2023.104300
M3 - Article
C2 - 36736446
AN - SCOPUS:85147797803
SN - 1532-0464
VL - 139
JO - Journal of Biomedical Informatics
JF - Journal of Biomedical Informatics
M1 - 104300
ER -