Speech-based Depression Detection Using Unsupervised Autoencoder

Guangyao Sun, Shenghui Zhao, Bochao Zou*, Yubo An

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

6 引用 (Scopus)

摘要

With the rapid development of society, over three hundred million people worldwide suffer from depression, which has become one of the most serious health problems in the world. As we know, depression detection is of great importance for its timely treatment. In this paper, a speech-based depression detection method using unsupervised autoencoder is proposed. Most previous methods encode the frame-level speech features into sentence-level features with statistical functions which lead to the loss of the temporal information between frames. To solve this, we propose an unsupervised network based on transformer. The unsupervised network is adopted to obtain the audio embedding vector of an audio segment from depressed or non-depressed people. Then the embedding audio vector is used for depression detection. The experimental results show that the proposed method achieves superior performance on both the English database DAIC and our self-built Chinese database CMDC.

源语言英语
主期刊名2022 7th International Conference on Signal and Image Processing, ICSIP 2022
出版商Institute of Electrical and Electronics Engineers Inc.
35-38
页数4
ISBN(电子版)9781665495639
DOI
出版状态已出版 - 2022
活动7th International Conference on Signal and Image Processing, ICSIP 2022 - Suzhou, 中国
期限: 20 7月 202222 7月 2022

出版系列

姓名2022 7th International Conference on Signal and Image Processing, ICSIP 2022

会议

会议7th International Conference on Signal and Image Processing, ICSIP 2022
国家/地区中国
Suzhou
时期20/07/2222/07/22

指纹

探究 'Speech-based Depression Detection Using Unsupervised Autoencoder' 的科研主题。它们共同构成独一无二的指纹。

引用此