KNN text categorization algorithm based on semantic centre

Xiao Fei Zhang*, He Yan Huang, Ke Liang Zhang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

6 引用 (Scopus)

摘要

As a classical statistical pattern recognition algorithm characterized with high accuracy and stability, KNN has been used widely in text categorization. But since KNN's time complexity is directly proportional to the sample size, its classification speed is very slow. In this paper, we propose a new KNN text categorization algorithm based on semantic centre, which we call SKNN, to speed up the classification. The basic thread is to replace the large number of original sample documents with a small amount of sample semantic centers. Experiments have proved that the SKNN's clarification is over 10 times as fast as that of the traditional KNN and its F1 value is approximately equal to SVM and traditional KNN algorithm.

源语言英语
主期刊名Proceedings - 2009 International Conference on Information Technology and Computer Science, ITCS 2009
249-252
页数4
DOI
出版状态已出版 - 2009
已对外发布
活动2009 International Conference on Information Technology and Computer Science, ITCS 2009 - Kiev, 乌克兰
期限: 25 7月 200926 7月 2009

出版系列

姓名Proceedings - 2009 International Conference on Information Technology and Computer Science, ITCS 2009
1

会议

会议2009 International Conference on Information Technology and Computer Science, ITCS 2009
国家/地区乌克兰
Kiev
时期25/07/0926/07/09

指纹

探究 'KNN text categorization algorithm based on semantic centre' 的科研主题。它们共同构成独一无二的指纹。

引用此