Method of text vector construction based on concept cluster

Yang Feng*, Sen Lin Luo, Li Min Pan, Li Li Liu, Kai Jiang Chen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

To enhance the performance of the text vector, terms were clustered, which contained similar syntax or semantic feature, to construct concept cluster. The text vector would be transformed from term-space to concept-cluster-space to represent the original text. The experiment compared effects of text classification based on TF-IDF, IG, TF-IDF-IG, LSA, and their combinations with concept cluster. And the results show that, the text vector based on concept cluster improves the accuracy of text concept approaching, and advances the discriminating degree between different types of texts.

Original languageEnglish
Pages (from-to)44-47
Number of pages4
JournalTongxin Xuebao/Journal on Communications
Volume31
Issue number8 A
Publication statusPublished - Aug 2010

Keywords

  • Chinese information processing
  • Concept cluster
  • Text classification
  • Text vector

Cite this