TY - JOUR
T1 - Depression recognition base on acoustic speech model of Multi-task emotional stimulus
AU - Xing, Yujuan
AU - Liu, Zhenyu
AU - Chen, Qiongqiong
AU - Li, Gang
AU - Ding, Zhijie
AU - Feng, Lei
AU - Hu, Bin
N1 - Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/8
Y1 - 2023/8
N2 - Depression places great burden on families and society owning to its high prevalence recurrence and disability mortality. Using efficient and objective methods to recognized depression has attracted more and more attention from researchers. Subtle changes in the speaker's physical and mental state will be subconsciously reflected in vocal apparatus. Individuals have different responses to different emotional stimuli. Speech signals are easily affected by emotional stimuli, and thus will have a great impact on depression recognition. This study has two aims, first was to collect speech data in different emotional stimulus (positive, neutral and negative), and explore effective feature set with strong interpretability. The second aim was to design efficient multi-task recognition model. A depression recognition method based on max-relevance and min-redundancy (mRMR) with multi-class labels (MCL-mRMR) and multi-task stimulus weighted Bagging (MTSW-Bagging) classifier was proposed. Firstly, MCL-mRMR selected features which had high correlation with emotional valence and depression, meanwhile features' dimensions decreased. Next, MTSW-Bagging classifier was designed to recognize depression, whose base classifier was composed of weighted multi-task emotional stimulus classifiers. Experimental results showed that the features selected by MCL-mRMR had higher performance with the accuracy and F1 score were increased by 5.59% and 4.2% respectively compared with the original full features. Meanwhile, our proposed method was superior to baseline method with an improvement of 13.2% and 12.8% on accuracy and F1 score respectively. Compared with state-of-the-art related methods, our method also had its superiority of strong interpretability of features and being independent of training data scale.
AB - Depression places great burden on families and society owning to its high prevalence recurrence and disability mortality. Using efficient and objective methods to recognized depression has attracted more and more attention from researchers. Subtle changes in the speaker's physical and mental state will be subconsciously reflected in vocal apparatus. Individuals have different responses to different emotional stimuli. Speech signals are easily affected by emotional stimuli, and thus will have a great impact on depression recognition. This study has two aims, first was to collect speech data in different emotional stimulus (positive, neutral and negative), and explore effective feature set with strong interpretability. The second aim was to design efficient multi-task recognition model. A depression recognition method based on max-relevance and min-redundancy (mRMR) with multi-class labels (MCL-mRMR) and multi-task stimulus weighted Bagging (MTSW-Bagging) classifier was proposed. Firstly, MCL-mRMR selected features which had high correlation with emotional valence and depression, meanwhile features' dimensions decreased. Next, MTSW-Bagging classifier was designed to recognize depression, whose base classifier was composed of weighted multi-task emotional stimulus classifiers. Experimental results showed that the features selected by MCL-mRMR had higher performance with the accuracy and F1 score were increased by 5.59% and 4.2% respectively compared with the original full features. Meanwhile, our proposed method was superior to baseline method with an improvement of 13.2% and 12.8% on accuracy and F1 score respectively. Compared with state-of-the-art related methods, our method also had its superiority of strong interpretability of features and being independent of training data scale.
KW - Bagging
KW - Depression recognition
KW - Emotional stimulus
KW - Feature selection
UR - http://www.scopus.com/inward/record.url?scp=85159711824&partnerID=8YFLogxK
U2 - 10.1016/j.bspc.2023.104970
DO - 10.1016/j.bspc.2023.104970
M3 - Article
AN - SCOPUS:85159711824
SN - 1746-8094
VL - 85
JO - Biomedical Signal Processing and Control
JF - Biomedical Signal Processing and Control
M1 - 104970
ER -