TY - GEN
T1 - KEIM
T2 - 29th International Conference on Database Systems for Advanced Applications, DASFAA 2024
AU - Luo, Zhaojing
AU - Zhang, Chi
AU - Wang, Hao
AU - Shi, Jiyun
AU - Zhang, Meihui
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
PY - 2024
Y1 - 2024
N2 - Electronic Health Records (EHR) include various sources of healthcare data collected from patients in hospitals. These data are typically stored in structured formats and are widely used in various big data healthcare analysis applications, particularly diagnosis prediction. Deep learning methods have achieved record-breaking results in various real-world prediction tasks. However, deep learning methods usually require a large amount of data for training, and the medical features that rarely appear in the data also pose great challenges for deep learning models. Besides, while deep learning models often achieve high accuracy, the lack of interpretation remains a problem for healthcare applications, which are naturally high-stakes. Existing works utilize medical ontology knowledge to enhance prediction performance and provide interpretable prediction results. Nevertheless, the ontology knowledge is coarse-grained, where many medical concepts and relationships are not included. In this paper, we propose to incorporate large-scale medical knowledge graphs (KGs) into our designed model, called KEIM (Knowledge graph Empowered Interpretable Model), for diagnosis prediction. Specifically, the KGs are first integrated into the time-series module of the model via a laplacian regularization to take advantage of the complex relationships among medical features. Subsequently, we construct a personalized KG for each visit and design a relation-aware attentive graph neural network based on the personalized KG to augment the time-series module for interpretable predictions. Extensive experiments on two benchmark healthcare datasets, namely, MIMIC-III and MIMIC-IV, show that our proposed KEIM not only achieves significant improvement in terms of AUC but also provides interpretability for diagnosis prediction with KGs.
AB - Electronic Health Records (EHR) include various sources of healthcare data collected from patients in hospitals. These data are typically stored in structured formats and are widely used in various big data healthcare analysis applications, particularly diagnosis prediction. Deep learning methods have achieved record-breaking results in various real-world prediction tasks. However, deep learning methods usually require a large amount of data for training, and the medical features that rarely appear in the data also pose great challenges for deep learning models. Besides, while deep learning models often achieve high accuracy, the lack of interpretation remains a problem for healthcare applications, which are naturally high-stakes. Existing works utilize medical ontology knowledge to enhance prediction performance and provide interpretable prediction results. Nevertheless, the ontology knowledge is coarse-grained, where many medical concepts and relationships are not included. In this paper, we propose to incorporate large-scale medical knowledge graphs (KGs) into our designed model, called KEIM (Knowledge graph Empowered Interpretable Model), for diagnosis prediction. Specifically, the KGs are first integrated into the time-series module of the model via a laplacian regularization to take advantage of the complex relationships among medical features. Subsequently, we construct a personalized KG for each visit and design a relation-aware attentive graph neural network based on the personalized KG to augment the time-series module for interpretable predictions. Extensive experiments on two benchmark healthcare datasets, namely, MIMIC-III and MIMIC-IV, show that our proposed KEIM not only achieves significant improvement in terms of AUC but also provides interpretability for diagnosis prediction with KGs.
KW - Diagnosis Prediction
KW - Healthcare Analytics
KW - Interpretability
KW - Medical Knowledge Graph
UR - http://www.scopus.com/inward/record.url?scp=85209578679&partnerID=8YFLogxK
U2 - 10.1007/978-981-97-5562-2_20
DO - 10.1007/978-981-97-5562-2_20
M3 - Conference contribution
AN - SCOPUS:85209578679
SN - 9789819755615
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 307
EP - 323
BT - Database Systems for Advanced Applications - 29th International Conference, DASFAA 2024, Proceedings
A2 - Onizuka, Makoto
A2 - Lee, Jae-Gil
A2 - Tong, Yongxin
A2 - Xiao, Chuan
A2 - Ishikawa, Yoshiharu
A2 - Lu, Kejing
A2 - Amer-Yahia, Sihem
A2 - Jagadish, H.V.
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 2 July 2024 through 5 July 2024
ER -