Unsupervised sentiment analysis of twitter posts using density matrix representation

  • Yazhou Zhang
  • , Dawei Song*
  • , Xiang Li
  • , Peng Zhang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Nowadays, a series of pioneering studies provide the evidence that quantum probability theory can be applied in information retrieval as a mathematical framework, such as Quantum Language Model (QLM) and its variants. In these studies, the density matrix, which is defined on the quantum probabilistic space, is used to represent query and document. However, these studies are only designed for information retrieval tasks, which are unable to model sentiment information. In this paper, we investigate the feasibility of quantum probability theory for twitter sentiment analysis, and propose a density matrix based unsupervised sentiment analysis approach. The main idea is to artificially create two sentiment dictionaries, generate density matrices of documents and dictionaries using an extended QLM, then employ the quantum relative entropy to judge the similarity between density matrices of documents and dictionaries. Extensive experiments are conducted on two widely used twitter datasets, which are the Obama-McCain Debate (OMD) dataset and Sentiment Strength Twitter Dataset (SS-Tweet). The experimental results show that our approach significantly outperforms a number of baselines, demonstrating the effectiveness of the proposed density matrix based sentiment analysis approach.

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 40th European Conference on IR Research, ECIR 2018, Proceedings
EditorsLeif Azzopardi, Gabriella Pasi, Allan Hanbury, Benjamin Piwowarski
PublisherSpringer Verlag
Pages316-329
Number of pages14
ISBN (Print)9783319769400
DOIs
Publication statusPublished - 2018
Externally publishedYes
Event40th European Conference on Information Retrieval, ECIR 2018 - Grenoble, France
Duration: 26 Mar 201829 Mar 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10772 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference40th European Conference on Information Retrieval, ECIR 2018
Country/TerritoryFrance
CityGrenoble
Period26/03/1829/03/18

Keywords

  • Density matrix
  • Quantum Language Model
  • Sentiment analysis

Fingerprint

Dive into the research topics of 'Unsupervised sentiment analysis of twitter posts using density matrix representation'. Together they form a unique fingerprint.

Cite this