Anxiety recognition based on multimodal social media data and cross-attention mechanism

Jianghong Zhu, Zhenwen Zhang, Zepeng Li*, Bin Hu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Anxiety disorder is a common mental illness that involves persistent and recurrent episodes of intense anxiety and sudden feelings of fear or terror, which seriously affect the patient's study, work and life. Social media contains a large amount of multimodal data reflecting people's inner activities and emotional states, providing a new way for recognizing anxiety disorders. However, existing methods tend to excessively focus on the text data, and neglect the role of other types of data such as images. To address this issue, this paper proposes a Multimodal Anxiety Recognition model (MAR-IDCA) based on Image Description and Cross-Attention mechanism. Firstly, for each image of the user's post, caption generation model is applied to obtain the abstract semantic information of the image, and the possible character information in the image is obtained through optical character recognition technology, so as to generate the description of the image. Secondly, through an image encoder and two text encoders, the visual feature of the image, the textual features of the image description and the post text are extracted respectively. Then, a multi-head cross-attention mechanism is employed to fuse the visual feature, image description feature and post text feature. Finally, combined with other auxiliary information of user posts, the final multimodal data representation is obtained and classified. Experiments on two multimodal anxiety recognition datasets show that, compared with the existing models, MAR-IDCA has better anxiety recognition performance.

Original languageEnglish
Article number130473
JournalNeurocomputing
Volume646
DOIs
Publication statusPublished - 14 Sept 2025
Externally publishedYes

Keywords

  • Anxiety recognition
  • Cross-attention mechanism
  • Image description
  • Multimodal data
  • Social media

Fingerprint

Dive into the research topics of 'Anxiety recognition based on multimodal social media data and cross-attention mechanism'. Together they form a unique fingerprint.

Cite this