Abstract
Depression recognition based on facial movements has garnered widespread attention in recent years. One key challenge faced by existing studies is that individual heterogeneity in facial movements severely affects model performance. This paper, for the first time, proposes an innovative solution targeting two critical aspects: data acquisition and model design. In data acquisition, we replace the traditional interview task with a video-watching paradigm to eliminate mouth movements induced by verbal expression to obtain pure facial movement flows. In model design, we treat facial action units (AUs) as covariates to align discriminative depression cues to mitigate the impact of individual heterogeneity. Specifically, our proposed FDSNet statistically selects AUs showing significant differences between depressed and healthy groups and combines the AUs-based scoring module and graph attention network to guide the model in dynamically focusing on discriminative key video segments. Experimental results show that FDSNet achieves superior accuracy and generalization performance compared to SOTA methods. Our work suggests that controlling data acquisition to ensure pure facial movement data, along with employing a personalized modeling strategy, are both critical to mitigating individual heterogeneity and enhancing model performance.
| Original language | English |
|---|---|
| Journal | IEEE Transactions on Circuits and Systems for Video Technology |
| DOIs | |
| Publication status | Accepted/In press - 2026 |
| Externally published | Yes |
Keywords
- Affective Computing
- Depression Recognition
- Facial Movements
- Graph Attention Network
- Individual Heterogeneity
Fingerprint
Dive into the research topics of 'A Covariate-Guided Graph Attention Network for Depression Recognition via Pure Facial Movements'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver