Design of text categorization system based on SVM

Zhenyan Liu*, Weiping Wang, Yong Wang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

This paper introduces the design of a text categorization system based on Support Vector Machine (SVM). It analyzes the high dimensional characteristic of text data, the reason why SVM is suitable for text categorization. According to system data flow this system is constructed. This system consists of three subsystems which are text representation, classifier training and text classification. The core of this system is the classifier training, but text representation directly influences the currency of classifier and the performance of the system. Text feature vector space can be built by different kinds of feature selection and feature extraction methods. No research can indicate which one is the best method, so many feature selection and feature extraction methods are all developed in this system. For a specific classification task every feature selection method and every feature extraction method will be tested, and then a set of the best methods will be adopted.

源语言英语
主期刊名Materials Science and Information Technology II
1191-1195
页数5
DOI
出版状态已出版 - 2012
活动2012 2nd International Conference on Materials Science and Information Technology, MSIT 2012 - Xi'an, Shaan, 中国
期限: 24 8月 201226 8月 2012

出版系列

姓名Advanced Materials Research
532-533
ISSN(印刷版)1022-6680

会议

会议2012 2nd International Conference on Materials Science and Information Technology, MSIT 2012
国家/地区中国
Xi'an, Shaan
时期24/08/1226/08/12

指纹

探究 'Design of text categorization system based on SVM' 的科研主题。它们共同构成独一无二的指纹。

引用此