TY - GEN
T1 - Speech-based Depression Detection Using Unsupervised Autoencoder
AU - Sun, Guangyao
AU - Zhao, Shenghui
AU - Zou, Bochao
AU - An, Yubo
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - With the rapid development of society, over three hundred million people worldwide suffer from depression, which has become one of the most serious health problems in the world. As we know, depression detection is of great importance for its timely treatment. In this paper, a speech-based depression detection method using unsupervised autoencoder is proposed. Most previous methods encode the frame-level speech features into sentence-level features with statistical functions which lead to the loss of the temporal information between frames. To solve this, we propose an unsupervised network based on transformer. The unsupervised network is adopted to obtain the audio embedding vector of an audio segment from depressed or non-depressed people. Then the embedding audio vector is used for depression detection. The experimental results show that the proposed method achieves superior performance on both the English database DAIC and our self-built Chinese database CMDC.
AB - With the rapid development of society, over three hundred million people worldwide suffer from depression, which has become one of the most serious health problems in the world. As we know, depression detection is of great importance for its timely treatment. In this paper, a speech-based depression detection method using unsupervised autoencoder is proposed. Most previous methods encode the frame-level speech features into sentence-level features with statistical functions which lead to the loss of the temporal information between frames. To solve this, we propose an unsupervised network based on transformer. The unsupervised network is adopted to obtain the audio embedding vector of an audio segment from depressed or non-depressed people. Then the embedding audio vector is used for depression detection. The experimental results show that the proposed method achieves superior performance on both the English database DAIC and our self-built Chinese database CMDC.
KW - Chinese depression database
KW - depression detection
KW - transformer
KW - unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85139448410&partnerID=8YFLogxK
U2 - 10.1109/ICSIP55141.2022.9886372
DO - 10.1109/ICSIP55141.2022.9886372
M3 - Conference contribution
AN - SCOPUS:85139448410
T3 - 2022 7th International Conference on Signal and Image Processing, ICSIP 2022
SP - 35
EP - 38
BT - 2022 7th International Conference on Signal and Image Processing, ICSIP 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Signal and Image Processing, ICSIP 2022
Y2 - 20 July 2022 through 22 July 2022
ER -