Rethinking the Information Inside Documents for Sentiment Classification

Xinyu Jiang, Chongyang Shi*, Shufeng Hao, Dequan Yang, Chaoqun Feng

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Document-level sentiment classification aims to predict the sentiment rating of a particular document. Most of the methods take word embeddings as the inputs of a neural network. However, most existing methods fail to account for the fact that a specific word usually contains a certain amount of redundant information, and generally different words contain different amounts of redundancy. Such ambiguous word representation will cause models to misunderstand a text, and thus to incorrectly predict the sentiment, meanwhile simply treating words with different amounts of redundancy equally is not appropriate. Moreover, these methods take the user ratings as the training target, which leads to the fact that the information selected by the model is usually limited to the rating itself and there is no obvious sentiment tendency. Accordingly, we propose a Rethinking mechanism (R-TM) to rethink the information inside documents. More specifically, R-TM filters out redundancy contained in different words from different levels, and selects information from positive and negative two different perspectives. Our experimental results demonstrate that the proposed mechanisms can achieve consistent improvements compared to state-of-the-art methods.

Original languageEnglish
Title of host publicationKnowledge Science, Engineering and Management - 14th International Conference, KSEM 2021, Proceedings
EditorsHan Qiu, Cheng Zhang, Zongming Fei, Meikang Qiu, Sun-Yuan Kung
PublisherSpringer Science and Business Media Deutschland GmbH
Pages421-432
Number of pages12
ISBN (Print)9783030821357
DOIs
Publication statusPublished - 2021
Event14th International Conference on Knowledge Science, Engineering and Management, KSEM 2021 - Tokyo, Japan
Duration: 14 Aug 202116 Aug 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12815 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Knowledge Science, Engineering and Management, KSEM 2021
Country/TerritoryJapan
CityTokyo
Period14/08/2116/08/21

Keywords

  • Double-perspective selection
  • Hierarchical architecture
  • Multi-level filtering
  • Rethinking mechanism
  • Sentiment classification

Fingerprint

Dive into the research topics of 'Rethinking the Information Inside Documents for Sentiment Classification'. Together they form a unique fingerprint.

Cite this