跳到主要导航 跳到搜索 跳到主要内容

Multi-Modal Emotion Recognition With Graph Reinforcement Representation Network for Human-Robot Interaction

  • Dan Chen
  • , Zhen Tao Liu*
  • , Jinhua She
  • , Kaoru Hirota
  • , Seiichi Kawata
  • *此作品的通讯作者
  • China University of Geosciences, Wuhan
  • Ministry of Education in China
  • Tokyo University of Technology
  • Institute of Science Tokyo

科研成果: 期刊稿件文章同行评审

摘要

Emotion recognition in conversation is essential for achieving intelligent human-robot interaction (HRI). Accurate emotional understanding allows robots to engage in more natural and context-aware interactions with users. A novel graph reinforcement representation architecture (RGATN) is proposed for emotion recognition in HRI. The RGATN aims to analyze multi-modal conversational information to infer users' emotional states during interaction. Specifically, the architecture integrates a residual graph network (Res-GN) and a cross-modal graph channel attention network (CM-GCA). The Res-GN is proposed to efficiently represents information by reducing redundancy in fully connected graphs and incorporating all potential connectivity relationships. To address the inconsistencies in the quality of different modalities in HRI scenarios, the CM-GCA mechanism is presented. This mechanism preserves information from each modality while reconstructing and enhancing the overall graph representation by leveraging the adaptive cross-modal channel attention. The proposed method improves emotion recognition accuracy and robustness in HRI. We evaluated the method on three benchmark datasets, including IEMOCAP, MELD and M ^{3} ED, achieving weighted F1-score of 72.07%, 67.99% and 53.16%, respectively. Additionally, preliminary application experiments conducted on a self-built database demonstrated a recognition accuracy of 78.75%. The results highlight the ability to effectively adapt to inconsistent modal quality, further confirming its effectiveness in real-world HRI scenarios.

源语言英语
期刊IEEE Transactions on Multimedia
DOI
出版状态已接受/待刊 - 2026
已对外发布

指纹

探究 'Multi-Modal Emotion Recognition With Graph Reinforcement Representation Network for Human-Robot Interaction' 的科研主题。它们共同构成独一无二的指纹。

引用此