On-line text categorization algorithm based on information fusion: Semantic SVM

Liu Ling Dai*, Xue Mei Li, He Yan Huang, Zhao Xiong Chen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

The aim of this paper is to make SVMs (Support Vector Machines) more applicable to on-line text categorization applications. As SVMs are of good generation ability even with small training sets and text feature vectors are clustery in the feature space, an algorithm for text categorization, namely, semantic Support Vector Machine (Semantic SVM), is proposed by substituting the original training text set with the semantic center set. This semantic center set is used as the training text and support vector candidates. The steps to generate the semantic center set and the framework of the on-line learning algorithm of semantic SVM are then presented, as well as the implementation of the on-line learning algorithm based on Sequential Minimal Optimization. Experimental results show that, compared with the standard SVMs, the proposed semantic SVM and its algorithm can improve the on-line learning speed and the classifying speed by orders with a high classifying veracity.

Original languageEnglish
Pages (from-to)67-72
Number of pages6
JournalHuanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science)
Volume32
Issue numberSUPPL.
Publication statusPublished - Nov 2004
Externally publishedYes

Keywords

  • On-line learning
  • Semantic Support Vector Machine
  • Support Vector Machine
  • Text categorization

Fingerprint

Dive into the research topics of 'On-line text categorization algorithm based on information fusion: Semantic SVM'. Together they form a unique fingerprint.

Cite this