A quantum-inspired multimodal sentiment analysis framework

Yazhou Zhang, Dawei Song*, Peng Zhang, Panpan Wang, Jingfei Li, Xiang Li, Benyou Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

84 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 84
  • Captures
    • Readers: 77
see details

摘要

Multimodal sentiment analysis aims to capture diversified sentiment information implied in data that are of different modalities (e.g., an image that is associated with a textual description or a set of textual labels). The key challenge is rooted on the “semantic gap” between different low-level content features and high-level semantic information. Existing approaches generally utilize a combination of multimodal features in a somehow heuristic way. However, how to employ and combine multiple information from different sources effectively is still an important yet largely unsolved problem. To address the problem, in this paper, we propose a Quantum-inspired Multimodal Sentiment Analysis (QMSA) framework. The framework consists of a Quantum-inspired Multimodal Representation (QMR) model (which aims to fill the “semantic gap” and model the correlations between different modalities via density matrix), and a Multimodal decision Fusion strategy inspired by Quantum Interference (QIMF) in the double-slit experiment (in which the sentiment label is analogous to a photon, and the data modalities are analogous to slits). Extensive experiments are conducted on two large scale datasets, which are collected from the Getty Images and Flickr photo sharing platform. The experimental results show that our approach significantly outperforms a wide range of baselines and state-of-the-art methods.

源语言英语
页(从-至)21-40
页数20
期刊Theoretical Computer Science
752
DOI
出版状态已出版 - 15 12月 2018

指纹

探究 'A quantum-inspired multimodal sentiment analysis framework' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhang, Y., Song, D., Zhang, P., Wang, P., Li, J., Li, X., & Wang, B. (2018). A quantum-inspired multimodal sentiment analysis framework. Theoretical Computer Science, 752, 21-40. https://doi.org/10.1016/j.tcs.2018.04.029