Multimodal depression detection using a deep feature fusion network

Guangyao Sun, Shenghui Zhao, Bochao Zou*, Yubo An

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Currently, more and more people are suffering from depression with the increase of social pressure, which has become one of the most severe health issues worldwide. Therefore, timely diagonosis of depression is very important. In this paper, a deep feature fusion network is proposed for multimodal depression detection. Firstly, an unsupervised autoencoder based on transformer is applied to derive the sentence-level embedding for the frame-level audiovisual features; then a deep feature fusion network based on a cross-modal transformer is proposed to fuse the text, audio and video features. The experimental results show that the proposed method achieves superior performance compared to state-of-the-art methods on the English database DAIC-WOZ.

源语言英语
主期刊名Third International Conference on Computer Science and Communication Technology, ICCSCT 2022
编辑Yingfa Lu, Changbo Cheng
出版商SPIE
ISBN(电子版)9781510661240
DOI
出版状态已出版 - 2022
活动3rd International Conference on Computer Science and Communication Technology, ICCSCT 2022 - Beijing, 中国
期限: 30 7月 202231 7月 2022

出版系列

姓名Proceedings of SPIE - The International Society for Optical Engineering
12506
ISSN(印刷版)0277-786X
ISSN(电子版)1996-756X

会议

会议3rd International Conference on Computer Science and Communication Technology, ICCSCT 2022
国家/地区中国
Beijing
时期30/07/2231/07/22

指纹

探究 'Multimodal depression detection using a deep feature fusion network' 的科研主题。它们共同构成独一无二的指纹。

引用此