TY - JOUR
T1 - Patient Health Representation Learning via Correlational Sparse Prior of Medical Features
AU - Ma, Xinyu
AU - Wang, Yasha
AU - Chu, Xu
AU - Ma, Liantao
AU - Tang, Wen
AU - Zhao, Junfeng
AU - Yuan, Ye
AU - Wang, Guoren
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2023/11/1
Y1 - 2023/11/1
N2 - Exploiting the correlations between medical features is essential to the success of healthcare data analysis. However, most existing methods are either suffering large estimation variance for data insufficiency or inflexible in terms of demanding task-specific medical knowledge. In this paper, we propose a novel patient health representation learning framework dubbed SAFARI. SAFARI learns a compact representation by imposing a clinical-fact-inspired task-agnostic correlational sparsity prior to the correlations of medical feature pairs. Specifically, we learn the compact representation by solving the bi-level optimization problem, which involves solving the high-level inter-group correlations and the nested lower-level intra-group correlations. We leverage the Laplacian kernel as a robust metric for feature grouping and graph neural networks for solving the bi-level optimization problem following the optimal value reformulation paradigm. Experiments on five datasets of various inputs and tasks demonstrate the efficacy of SAFARI. The discovered findings are also consistent with our insights and medical literature, which can provide valuable clinical explanations.
AB - Exploiting the correlations between medical features is essential to the success of healthcare data analysis. However, most existing methods are either suffering large estimation variance for data insufficiency or inflexible in terms of demanding task-specific medical knowledge. In this paper, we propose a novel patient health representation learning framework dubbed SAFARI. SAFARI learns a compact representation by imposing a clinical-fact-inspired task-agnostic correlational sparsity prior to the correlations of medical feature pairs. Specifically, we learn the compact representation by solving the bi-level optimization problem, which involves solving the high-level inter-group correlations and the nested lower-level intra-group correlations. We leverage the Laplacian kernel as a robust metric for feature grouping and graph neural networks for solving the bi-level optimization problem following the optimal value reformulation paradigm. Experiments on five datasets of various inputs and tasks demonstrate the efficacy of SAFARI. The discovered findings are also consistent with our insights and medical literature, which can provide valuable clinical explanations.
KW - Electronic health records
KW - clinical risk prediction
KW - representation learning
UR - http://www.scopus.com/inward/record.url?scp=85146223741&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2022.3230454
DO - 10.1109/TKDE.2022.3230454
M3 - Article
AN - SCOPUS:85146223741
SN - 1041-4347
VL - 35
SP - 11769
EP - 11783
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 11
ER -