Chinese comparative sentence identification based on the combination of rules and statistics

Quanchao Liu, Heyan Huang, Chen Zhang, Zhenzhao Chen, Jiajun Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Citations (Scopus)

Abstract

Opinions always carry important information of texts, but comparative sentence is a common way to express opinions. We describe how to recognize comparative sentences from Chinese text documents by combining rule-based methods and statistical methods as well as analyze the performance of these methods. The method firstly normalizes the corpus and Chinese word segmentation, and then gets the broad extraction results by using comparative words, sentence structure templates and dependency relation analysis. Finally we take CSR, comparative words and statistical feature words as classification features of SVM to accurately identify comparative sentences in the broad extraction results. The experiments with COAE 2013's test data show that our approach provides better performance than the baselines and most systems reported at CCIR 2013.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 9th International Conference, ADMA 2013, Proceedings
Pages300-310
Number of pages11
EditionPART 2
DOIs
Publication statusPublished - 2013
Event9th International Conference on Advanced Data Mining and Applications, ADMA 2013 - Hangzhou, China
Duration: 14 Dec 201316 Dec 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume8347 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Advanced Data Mining and Applications, ADMA 2013
Country/TerritoryChina
CityHangzhou
Period14/12/1316/12/13

Keywords

  • CRF
  • CSR
  • Chinese Comparative Sentence
  • Comparative Sentence
  • SVM

Fingerprint

Dive into the research topics of 'Chinese comparative sentence identification based on the combination of rules and statistics'. Together they form a unique fingerprint.

Cite this