TY - GEN
T1 - Rethinking the Information Inside Documents for Sentiment Classification
AU - Jiang, Xinyu
AU - Shi, Chongyang
AU - Hao, Shufeng
AU - Yang, Dequan
AU - Feng, Chaoqun
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Document-level sentiment classification aims to predict the sentiment rating of a particular document. Most of the methods take word embeddings as the inputs of a neural network. However, most existing methods fail to account for the fact that a specific word usually contains a certain amount of redundant information, and generally different words contain different amounts of redundancy. Such ambiguous word representation will cause models to misunderstand a text, and thus to incorrectly predict the sentiment, meanwhile simply treating words with different amounts of redundancy equally is not appropriate. Moreover, these methods take the user ratings as the training target, which leads to the fact that the information selected by the model is usually limited to the rating itself and there is no obvious sentiment tendency. Accordingly, we propose a Rethinking mechanism (R-TM) to rethink the information inside documents. More specifically, R-TM filters out redundancy contained in different words from different levels, and selects information from positive and negative two different perspectives. Our experimental results demonstrate that the proposed mechanisms can achieve consistent improvements compared to state-of-the-art methods.
AB - Document-level sentiment classification aims to predict the sentiment rating of a particular document. Most of the methods take word embeddings as the inputs of a neural network. However, most existing methods fail to account for the fact that a specific word usually contains a certain amount of redundant information, and generally different words contain different amounts of redundancy. Such ambiguous word representation will cause models to misunderstand a text, and thus to incorrectly predict the sentiment, meanwhile simply treating words with different amounts of redundancy equally is not appropriate. Moreover, these methods take the user ratings as the training target, which leads to the fact that the information selected by the model is usually limited to the rating itself and there is no obvious sentiment tendency. Accordingly, we propose a Rethinking mechanism (R-TM) to rethink the information inside documents. More specifically, R-TM filters out redundancy contained in different words from different levels, and selects information from positive and negative two different perspectives. Our experimental results demonstrate that the proposed mechanisms can achieve consistent improvements compared to state-of-the-art methods.
KW - Double-perspective selection
KW - Hierarchical architecture
KW - Multi-level filtering
KW - Rethinking mechanism
KW - Sentiment classification
UR - http://www.scopus.com/inward/record.url?scp=85113779039&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-82136-4_34
DO - 10.1007/978-3-030-82136-4_34
M3 - Conference contribution
AN - SCOPUS:85113779039
SN - 9783030821357
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 421
EP - 432
BT - Knowledge Science, Engineering and Management - 14th International Conference, KSEM 2021, Proceedings
A2 - Qiu, Han
A2 - Zhang, Cheng
A2 - Fei, Zongming
A2 - Qiu, Meikang
A2 - Kung, Sun-Yuan
PB - Springer Science and Business Media Deutschland GmbH
T2 - 14th International Conference on Knowledge Science, Engineering and Management, KSEM 2021
Y2 - 14 August 2021 through 16 August 2021
ER -