TY - GEN
T1 - Generating Emotional Coherence and Diverse Responses in a Multimodal Dialogue System
AU - Huang, Yunfei
AU - Li, Kan
AU - Chen, Zhuo
AU - Wang, Lipeng
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - The perception of emotion and the diversity of generated response are two key factors considered by researchers in multimodal dialogue generation. However, in the field of multimodal dialogue generation, these two key factors have not been considered at the same time. In our model, we first extract the features of each modal from the multimodal context dialogue, and use the heterogeneous graph neural network to represent the large graph network composed of dialogue history, voice, video, and speaker's emotional state. Then, we use conditional variational autoencoders to generate coherent and diverse responses. A large number of experiments have shown that our model can not only automatically generate reaction emotions in two multimodal datasets, but also has coherence and controllability, which is significantly better than previous more advanced models.
AB - The perception of emotion and the diversity of generated response are two key factors considered by researchers in multimodal dialogue generation. However, in the field of multimodal dialogue generation, these two key factors have not been considered at the same time. In our model, we first extract the features of each modal from the multimodal context dialogue, and use the heterogeneous graph neural network to represent the large graph network composed of dialogue history, voice, video, and speaker's emotional state. Then, we use conditional variational autoencoders to generate coherent and diverse responses. A large number of experiments have shown that our model can not only automatically generate reaction emotions in two multimodal datasets, but also has coherence and controllability, which is significantly better than previous more advanced models.
KW - CVAE
KW - Dialogue system
KW - Graph heterogeneous neural network
KW - emotional dialogue generation
UR - http://www.scopus.com/inward/record.url?scp=85128646312&partnerID=8YFLogxK
U2 - 10.1109/CECIT53797.2021.00115
DO - 10.1109/CECIT53797.2021.00115
M3 - Conference contribution
AN - SCOPUS:85128646312
T3 - Proceedings - 2021 2nd International Conference on Electronics, Communications and Information Technology, CECIT 2021
SP - 625
EP - 630
BT - Proceedings - 2021 2nd International Conference on Electronics, Communications and Information Technology, CECIT 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Electronics, Communications and Information Technology, CECIT 2021
Y2 - 27 December 2021 through 29 December 2021
ER -