TY - JOUR
T1 - PRA-Net
T2 - Part-and-Relation Attention Network for depression recognition from facial expression
AU - Liu, Zhenyu
AU - Yuan, Xiaoyan
AU - Li, Yutong
AU - Shangguan, Zixuan
AU - Zhou, Li
AU - Hu, Bin
N1 - Publisher Copyright:
© 2023
PY - 2023/5
Y1 - 2023/5
N2 - Artificial intelligence methods are widely applied to depression recognition and provide an objective solution. Many effective automated methods for detecting depression use facial expressions, which are strong indicators to reflect psychiatric disorders. However, these methods suffer from insufficient representations of depression. To this end, we propose a novel Part-and-Relation Attention Network (PRA-Net), which can enhance depression representations by accurately focusing on features that are highly correlated with depression. Specifically, we first perform partition on the feature map instead of the original image, in order to obtain part features rich in semantic information. Afterwards, self-attention is used to calculate the weight of each part feature. Following, the relationship between the part feature and the global content representation is explored by relation attention to refine the weight. Finally, all features are aggregated into a more compact and depression-informative representation via both weights for depression score prediction. Extensive experiments demonstrate the superiority of our method. Compared to other end-to-end methods, our method achieves state-of-the-art performance on AVEC2013 and AVEC2014.
AB - Artificial intelligence methods are widely applied to depression recognition and provide an objective solution. Many effective automated methods for detecting depression use facial expressions, which are strong indicators to reflect psychiatric disorders. However, these methods suffer from insufficient representations of depression. To this end, we propose a novel Part-and-Relation Attention Network (PRA-Net), which can enhance depression representations by accurately focusing on features that are highly correlated with depression. Specifically, we first perform partition on the feature map instead of the original image, in order to obtain part features rich in semantic information. Afterwards, self-attention is used to calculate the weight of each part feature. Following, the relationship between the part feature and the global content representation is explored by relation attention to refine the weight. Finally, all features are aggregated into a more compact and depression-informative representation via both weights for depression score prediction. Extensive experiments demonstrate the superiority of our method. Compared to other end-to-end methods, our method achieves state-of-the-art performance on AVEC2013 and AVEC2014.
KW - Automatic depression detection
KW - End-to-end network
KW - Facial expression
KW - Part features
KW - Relation attention
UR - http://www.scopus.com/inward/record.url?scp=85150230715&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2023.106589
DO - 10.1016/j.compbiomed.2023.106589
M3 - Article
C2 - 36934531
AN - SCOPUS:85150230715
SN - 0010-4825
VL - 157
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
M1 - 106589
ER -