TY - JOUR
T1 - An attention-based multi-modal MRI fusion model for major depressive disorder diagnosis
AU - Zheng, Guowei
AU - Zheng, Weihao
AU - Zhang, Yu
AU - Wang, Junyu
AU - Chen, Miao
AU - Wang, Yin
AU - Cai, Tianhong
AU - Yao, Zhijun
AU - Hu, Bin
N1 - Publisher Copyright:
© 2023 IOP Publishing Ltd
PY - 2023/12/1
Y1 - 2023/12/1
N2 - Objective. Major depressive disorder (MDD) is one of the biggest threats to human mental health. MDD is characterized by aberrant changes in both structure and function of the brain. Although recent studies have developed some deep learning models based on multi-modal magnetic resonance imaging (MRI) for MDD diagnosis, the latent associations between deep features derived from different modalities were largely unexplored by previous studies, which we hypothesized may have potential benefits in improving the diagnostic accuracy of MDD. Approach. In this study, we proposed a novel deep learning model that fused both structural MRI (sMRI) and resting-state MRI (rs-fMRI) data to enhance the diagnosis of MDD by capturing the interactions between deep features extracted from different modalities. Specifically, we first employed a brain function encoder (BFE) and a brain structure encoder (BSE) to extract the deep features from fMRI and sMRI, respectively. Then, we designed a function and structure co-attention fusion (FSCF) module that captured inter-modal interactions and adaptively fused multi-modal deep features for MDD diagnosis. Main results. This model was evaluated on a large cohort and achieved a high classification accuracy of 75.2% for MDD diagnosis. Moreover, the attention distribution of the FSCF module assigned higher attention weights to structural features than functional features for diagnosing MDD. Significance. The high classification accuracy highlights the effectiveness and potential clinical of the proposed model.
AB - Objective. Major depressive disorder (MDD) is one of the biggest threats to human mental health. MDD is characterized by aberrant changes in both structure and function of the brain. Although recent studies have developed some deep learning models based on multi-modal magnetic resonance imaging (MRI) for MDD diagnosis, the latent associations between deep features derived from different modalities were largely unexplored by previous studies, which we hypothesized may have potential benefits in improving the diagnostic accuracy of MDD. Approach. In this study, we proposed a novel deep learning model that fused both structural MRI (sMRI) and resting-state MRI (rs-fMRI) data to enhance the diagnosis of MDD by capturing the interactions between deep features extracted from different modalities. Specifically, we first employed a brain function encoder (BFE) and a brain structure encoder (BSE) to extract the deep features from fMRI and sMRI, respectively. Then, we designed a function and structure co-attention fusion (FSCF) module that captured inter-modal interactions and adaptively fused multi-modal deep features for MDD diagnosis. Main results. This model was evaluated on a large cohort and achieved a high classification accuracy of 75.2% for MDD diagnosis. Moreover, the attention distribution of the FSCF module assigned higher attention weights to structural features than functional features for diagnosing MDD. Significance. The high classification accuracy highlights the effectiveness and potential clinical of the proposed model.
KW - deep learning
KW - magnetic resonance imaging
KW - major depressive disorder
KW - multi-modal fusion
UR - http://www.scopus.com/inward/record.url?scp=85183505433&partnerID=8YFLogxK
U2 - 10.1088/1741-2552/ad038c
DO - 10.1088/1741-2552/ad038c
M3 - Article
C2 - 37844568
AN - SCOPUS:85183505433
SN - 1741-2560
VL - 20
JO - Journal of Neural Engineering
JF - Journal of Neural Engineering
IS - 6
M1 - 066005
ER -