LLM-based semantic integration of stimulus–response pairs for depression detection in interview scenarios

  • Zhenyu Liu
  • , Jiahang Chen
  • , Bo Chen
  • , Bohua Zhao
  • , Haibo Zhang
  • , Gang Li
  • , Zhijie Ding
  • , Bin Hu*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Depression is becoming increasingly prevalent with the accelerating pace of life and rising psychological pressures. While text-based depression detection has gained attention, most existing approaches focus solely on participants’ responses, overlooking the semantic role of the prompting questions–an essential component in clinical assessments. Although some studies have attempted to incorporate stimulus information, they often lack targeted design and contextual depth, limiting their effectiveness. To address these limitations, we propose a prompt-based explicit fusion strategy that leverages large language models to accurately integrate the semantics of both stimuli and responses, thereby reducing context comprehension bias. Furthermore, we introduce the FMAN framework, which combines multi-scale semantic features from diverse pre-trained models and a dynamic focused attention module for implicit fusion. We evaluate our method on three datasets: MIDD (536 Chinese participants), DAIC* (189 English participants), and CMDC (78 Chinese participants). Our approach achieves accuracies of 87.31 %, 80.43 %, and 98.89 %, respectively, surpassing mainstream models. These results demonstrate the effectiveness of our framework and offer new insights into NLP-based depression detection in interview scenarios.

Original languageEnglish
JournalExpert Systems with Applications
Volume300
DOIs
Publication statusPublished - Mar 2026
Externally publishedYes

Keywords

  • Depression detection
  • Large language models
  • Natural language processing
  • Semantic integration
  • Stimulus-response

Fingerprint

Dive into the research topics of 'LLM-based semantic integration of stimulus–response pairs for depression detection in interview scenarios'. Together they form a unique fingerprint.

Cite this