Combining Informative Regions and Clips for Detecting Depression from Facial Expressions

Xiaoyan Yuan; Zhenyu Liu; Qiongqiong Chen; Gang Li; Zhijie Ding; Zixuan Shangguan; Bin Hu

doi:10.1007/s12559-023-10157-0

Combining Informative Regions and Clips for Detecting Depression from Facial Expressions

Xiaoyan Yuan, Zhenyu Liu^*, Qiongqiong Chen, Gang Li, Zhijie Ding, Zixuan Shangguan, Bin Hu^*

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

Artificial intelligence methods are widely applied to depression recognition and provide an objective solution. Many effective automated methods for detecting depression use facial expressions, which are strong indicators of psychiatric disorders. However, existing approaches ignore the uneven distribution of depression information in time and space. Therefore, these approaches have limitations in their ability to form discriminative depression representations. In this paper, we propose a framework based on information regions and clips for depression detection. Specifically, we first divide the regions of interest (ROIs), which are regarded as spatially informative regions, according to pathological knowledge of depression. Following this, the local-MHHLBP-BiLSTM (LMB) module is proposed as a feature extractor to exploit short-term and long-term temporal information. Finally, an improved attention mechanism with a balancing factor is introduced into LMB to increase attention to information segments. The proposed model performs tenfold cross-validation on our 150-subject video dataset and outperforms most state-of-the-art approaches with accuracy = 0.757, precision = 0.767, recall = 0.786, and F1 score = 0.761. The obtained results demonstrate that focusing on information regions, and clips can effectively reduce the error in depression diagnosis. More importantly, we observe that the area near the eye is fairly informative and that depressed individuals blink more frequently.

源语言	英语
页（从-至）	1961-1972
页数	12
期刊	Cognitive Computation
卷	15
期	6
DOI	https://doi.org/10.1007/s12559-023-10157-0
出版状态	已出版 - 11月 2023
已对外发布	是

访问文件

10.1007/s12559-023-10157-0

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{58548af198134e21b7a4e32431aeeaff,

title = "Combining Informative Regions and Clips for Detecting Depression from Facial Expressions",

abstract = "Artificial intelligence methods are widely applied to depression recognition and provide an objective solution. Many effective automated methods for detecting depression use facial expressions, which are strong indicators of psychiatric disorders. However, existing approaches ignore the uneven distribution of depression information in time and space. Therefore, these approaches have limitations in their ability to form discriminative depression representations. In this paper, we propose a framework based on information regions and clips for depression detection. Specifically, we first divide the regions of interest (ROIs), which are regarded as spatially informative regions, according to pathological knowledge of depression. Following this, the local-MHHLBP-BiLSTM (LMB) module is proposed as a feature extractor to exploit short-term and long-term temporal information. Finally, an improved attention mechanism with a balancing factor is introduced into LMB to increase attention to information segments. The proposed model performs tenfold cross-validation on our 150-subject video dataset and outperforms most state-of-the-art approaches with accuracy = 0.757, precision = 0.767, recall = 0.786, and F1 score = 0.761. The obtained results demonstrate that focusing on information regions, and clips can effectively reduce the error in depression diagnosis. More importantly, we observe that the area near the eye is fairly informative and that depressed individuals blink more frequently.",

keywords = "Artificial intelligence, Attention mechanism, Automatic depression recognition, Facial expression, Information regions and clips",

author = "Xiaoyan Yuan and Zhenyu Liu and Qiongqiong Chen and Gang Li and Zhijie Ding and Zixuan Shangguan and Bin Hu",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.",

year = "2023",

month = nov,

doi = "10.1007/s12559-023-10157-0",

language = "English",

volume = "15",

pages = "1961--1972",

journal = "Cognitive Computation",

issn = "1866-9956",

publisher = "Springer New York",

number = "6",

}

TY - JOUR

T1 - Combining Informative Regions and Clips for Detecting Depression from Facial Expressions

AU - Yuan, Xiaoyan

AU - Liu, Zhenyu

AU - Chen, Qiongqiong

AU - Li, Gang

AU - Ding, Zhijie

AU - Shangguan, Zixuan

AU - Hu, Bin

PY - 2023/11

Y1 - 2023/11

N2 - Artificial intelligence methods are widely applied to depression recognition and provide an objective solution. Many effective automated methods for detecting depression use facial expressions, which are strong indicators of psychiatric disorders. However, existing approaches ignore the uneven distribution of depression information in time and space. Therefore, these approaches have limitations in their ability to form discriminative depression representations. In this paper, we propose a framework based on information regions and clips for depression detection. Specifically, we first divide the regions of interest (ROIs), which are regarded as spatially informative regions, according to pathological knowledge of depression. Following this, the local-MHHLBP-BiLSTM (LMB) module is proposed as a feature extractor to exploit short-term and long-term temporal information. Finally, an improved attention mechanism with a balancing factor is introduced into LMB to increase attention to information segments. The proposed model performs tenfold cross-validation on our 150-subject video dataset and outperforms most state-of-the-art approaches with accuracy = 0.757, precision = 0.767, recall = 0.786, and F1 score = 0.761. The obtained results demonstrate that focusing on information regions, and clips can effectively reduce the error in depression diagnosis. More importantly, we observe that the area near the eye is fairly informative and that depressed individuals blink more frequently.

AB - Artificial intelligence methods are widely applied to depression recognition and provide an objective solution. Many effective automated methods for detecting depression use facial expressions, which are strong indicators of psychiatric disorders. However, existing approaches ignore the uneven distribution of depression information in time and space. Therefore, these approaches have limitations in their ability to form discriminative depression representations. In this paper, we propose a framework based on information regions and clips for depression detection. Specifically, we first divide the regions of interest (ROIs), which are regarded as spatially informative regions, according to pathological knowledge of depression. Following this, the local-MHHLBP-BiLSTM (LMB) module is proposed as a feature extractor to exploit short-term and long-term temporal information. Finally, an improved attention mechanism with a balancing factor is introduced into LMB to increase attention to information segments. The proposed model performs tenfold cross-validation on our 150-subject video dataset and outperforms most state-of-the-art approaches with accuracy = 0.757, precision = 0.767, recall = 0.786, and F1 score = 0.761. The obtained results demonstrate that focusing on information regions, and clips can effectively reduce the error in depression diagnosis. More importantly, we observe that the area near the eye is fairly informative and that depressed individuals blink more frequently.

KW - Artificial intelligence

KW - Attention mechanism

KW - Automatic depression recognition

KW - Facial expression

KW - Information regions and clips

UR - http://www.scopus.com/inward/record.url?scp=85163093987&partnerID=8YFLogxK

U2 - 10.1007/s12559-023-10157-0

DO - 10.1007/s12559-023-10157-0

M3 - Article

AN - SCOPUS:85163093987

SN - 1866-9956

VL - 15

SP - 1961

EP - 1972

JO - Cognitive Computation

JF - Cognitive Computation

IS - 6

ER -

Combining Informative Regions and Clips for Detecting Depression from Facial Expressions

摘要

访问文件

其它文件与链接

指纹

引用此