A Quantum-Like multimodal network framework for modeling interaction dynamics in multiparty conversational sentiment analysis

Yazhou Zhang; Dawei Song; Xiang Li; Peng Zhang; Panpan Wang; Lu Rong; Guangliang Yu; Bo Wang

doi:10.1016/j.inffus.2020.04.003

A Quantum-Like multimodal network framework for modeling interaction dynamics in multiparty conversational sentiment analysis

Yazhou Zhang, Dawei Song^*, Xiang Li, Peng Zhang, Panpan Wang, Lu Rong, Guangliang Yu, Bo Wang

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

73 引用（Scopus）

摘要

Sentiment analysis in conversations is an emerging yet challenging artificial intelligence (AI) task. It aims to discover the affective states and emotional changes of speakers involved in a conversation on the basis of their opinions, which are carried by different modalities of information (e.g., a video associated with a transcript). There exists a wealth of intra- and inter-utterance interaction information that affects the emotions of speakers in a complex and dynamic way. How to accurately and comprehensively model complicated interactions is the key problem of the field. To fill this gap, in this paper, we propose a novel and comprehensive framework for multimodal sentiment analysis in conversations, called a quantum-like multimodal network (QMN), which leverages the mathematical formalism of quantum theory (QT) and a long short-term memory (LSTM) network. Specifically, the QMN framework consists of a multimodal decision fusion approach inspired by quantum interference theory to capture the interactions within each utterance (i.e., the correlations between different modalities) and a strong-weak influence model inspired by quantum measurement theory to model the interactions between adjacent utterances (i.e., how one speaker influences another). Extensive experiments are conducted on two widely used conversational sentiment datasets: the MELD and IEMOCAP datasets. The experimental results show that our approach significantly outperforms a wide range of baselines and state-of-the-art models.

源语言	英语
页（从-至）	14-31
页数	18
期刊	Information Fusion
卷	62
DOI	https://doi.org/10.1016/j.inffus.2020.04.003
出版状态	已出版 - 10月 2020

访问文件

10.1016/j.inffus.2020.04.003

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3dd14aadcfbe49ce8679a26db597c774,

title = "A Quantum-Like multimodal network framework for modeling interaction dynamics in multiparty conversational sentiment analysis",

abstract = "Sentiment analysis in conversations is an emerging yet challenging artificial intelligence (AI) task. It aims to discover the affective states and emotional changes of speakers involved in a conversation on the basis of their opinions, which are carried by different modalities of information (e.g., a video associated with a transcript). There exists a wealth of intra- and inter-utterance interaction information that affects the emotions of speakers in a complex and dynamic way. How to accurately and comprehensively model complicated interactions is the key problem of the field. To fill this gap, in this paper, we propose a novel and comprehensive framework for multimodal sentiment analysis in conversations, called a quantum-like multimodal network (QMN), which leverages the mathematical formalism of quantum theory (QT) and a long short-term memory (LSTM) network. Specifically, the QMN framework consists of a multimodal decision fusion approach inspired by quantum interference theory to capture the interactions within each utterance (i.e., the correlations between different modalities) and a strong-weak influence model inspired by quantum measurement theory to model the interactions between adjacent utterances (i.e., how one speaker influences another). Extensive experiments are conducted on two widely used conversational sentiment datasets: the MELD and IEMOCAP datasets. The experimental results show that our approach significantly outperforms a wide range of baselines and state-of-the-art models.",

keywords = "Human conversation, Interactive dynamics, Long short-term memory (LSTM) network, Multimodal sentiment analysis, Quantum theory",

author = "Yazhou Zhang and Dawei Song and Xiang Li and Peng Zhang and Panpan Wang and Lu Rong and Guangliang Yu and Bo Wang",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier B.V.",

year = "2020",

month = oct,

doi = "10.1016/j.inffus.2020.04.003",

language = "English",

volume = "62",

pages = "14--31",

journal = "Information Fusion",

issn = "1566-2535",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - A Quantum-Like multimodal network framework for modeling interaction dynamics in multiparty conversational sentiment analysis

AU - Zhang, Yazhou

AU - Song, Dawei

AU - Li, Xiang

AU - Zhang, Peng

AU - Wang, Panpan

AU - Rong, Lu

AU - Yu, Guangliang

AU - Wang, Bo

PY - 2020/10

Y1 - 2020/10

N2 - Sentiment analysis in conversations is an emerging yet challenging artificial intelligence (AI) task. It aims to discover the affective states and emotional changes of speakers involved in a conversation on the basis of their opinions, which are carried by different modalities of information (e.g., a video associated with a transcript). There exists a wealth of intra- and inter-utterance interaction information that affects the emotions of speakers in a complex and dynamic way. How to accurately and comprehensively model complicated interactions is the key problem of the field. To fill this gap, in this paper, we propose a novel and comprehensive framework for multimodal sentiment analysis in conversations, called a quantum-like multimodal network (QMN), which leverages the mathematical formalism of quantum theory (QT) and a long short-term memory (LSTM) network. Specifically, the QMN framework consists of a multimodal decision fusion approach inspired by quantum interference theory to capture the interactions within each utterance (i.e., the correlations between different modalities) and a strong-weak influence model inspired by quantum measurement theory to model the interactions between adjacent utterances (i.e., how one speaker influences another). Extensive experiments are conducted on two widely used conversational sentiment datasets: the MELD and IEMOCAP datasets. The experimental results show that our approach significantly outperforms a wide range of baselines and state-of-the-art models.

AB - Sentiment analysis in conversations is an emerging yet challenging artificial intelligence (AI) task. It aims to discover the affective states and emotional changes of speakers involved in a conversation on the basis of their opinions, which are carried by different modalities of information (e.g., a video associated with a transcript). There exists a wealth of intra- and inter-utterance interaction information that affects the emotions of speakers in a complex and dynamic way. How to accurately and comprehensively model complicated interactions is the key problem of the field. To fill this gap, in this paper, we propose a novel and comprehensive framework for multimodal sentiment analysis in conversations, called a quantum-like multimodal network (QMN), which leverages the mathematical formalism of quantum theory (QT) and a long short-term memory (LSTM) network. Specifically, the QMN framework consists of a multimodal decision fusion approach inspired by quantum interference theory to capture the interactions within each utterance (i.e., the correlations between different modalities) and a strong-weak influence model inspired by quantum measurement theory to model the interactions between adjacent utterances (i.e., how one speaker influences another). Extensive experiments are conducted on two widely used conversational sentiment datasets: the MELD and IEMOCAP datasets. The experimental results show that our approach significantly outperforms a wide range of baselines and state-of-the-art models.

KW - Human conversation

KW - Interactive dynamics

KW - Long short-term memory (LSTM) network

KW - Multimodal sentiment analysis

KW - Quantum theory

UR - http://www.scopus.com/inward/record.url?scp=85083718014&partnerID=8YFLogxK

U2 - 10.1016/j.inffus.2020.04.003

DO - 10.1016/j.inffus.2020.04.003

M3 - Article

AN - SCOPUS:85083718014

SN - 1566-2535

VL - 62

SP - 14

EP - 31

JO - Information Fusion

JF - Information Fusion

ER -

A Quantum-Like multimodal network framework for modeling interaction dynamics in multiparty conversational sentiment analysis

摘要

访问文件

其它文件与链接

指纹

引用此