Generating Emotional Coherence and Diverse Responses in a Multimodal Dialogue System

Yunfei Huang, Kan Li, Zhuo Chen, Lipeng Wang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The perception of emotion and the diversity of generated response are two key factors considered by researchers in multimodal dialogue generation. However, in the field of multimodal dialogue generation, these two key factors have not been considered at the same time. In our model, we first extract the features of each modal from the multimodal context dialogue, and use the heterogeneous graph neural network to represent the large graph network composed of dialogue history, voice, video, and speaker's emotional state. Then, we use conditional variational autoencoders to generate coherent and diverse responses. A large number of experiments have shown that our model can not only automatically generate reaction emotions in two multimodal datasets, but also has coherence and controllability, which is significantly better than previous more advanced models.

源语言英语
主期刊名Proceedings - 2021 2nd International Conference on Electronics, Communications and Information Technology, CECIT 2021
出版商Institute of Electrical and Electronics Engineers Inc.
625-630
页数6
ISBN(电子版)9781665437578
DOI
出版状态已出版 - 2021
活动2nd International Conference on Electronics, Communications and Information Technology, CECIT 2021 - Virtual, Sanya, 中国
期限: 27 12月 202129 12月 2021

出版系列

姓名Proceedings - 2021 2nd International Conference on Electronics, Communications and Information Technology, CECIT 2021

会议

会议2nd International Conference on Electronics, Communications and Information Technology, CECIT 2021
国家/地区中国
Virtual, Sanya
时期27/12/2129/12/21

指纹

探究 'Generating Emotional Coherence and Diverse Responses in a Multimodal Dialogue System' 的科研主题。它们共同构成独一无二的指纹。

引用此