Abstract
The aim of this paper is to make SVMs (Support Vector Machines) more applicable to on-line text categorization applications. As SVMs are of good generation ability even with small training sets and text feature vectors are clustery in the feature space, an algorithm for text categorization, namely, semantic Support Vector Machine (Semantic SVM), is proposed by substituting the original training text set with the semantic center set. This semantic center set is used as the training text and support vector candidates. The steps to generate the semantic center set and the framework of the on-line learning algorithm of semantic SVM are then presented, as well as the implementation of the on-line learning algorithm based on Sequential Minimal Optimization. Experimental results show that, compared with the standard SVMs, the proposed semantic SVM and its algorithm can improve the on-line learning speed and the classifying speed by orders with a high classifying veracity.
Original language | English |
---|---|
Pages (from-to) | 67-72 |
Number of pages | 6 |
Journal | Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science) |
Volume | 32 |
Issue number | SUPPL. |
Publication status | Published - Nov 2004 |
Externally published | Yes |
Keywords
- On-line learning
- Semantic Support Vector Machine
- Support Vector Machine
- Text categorization