A Chinese web page classification algorithm based on combination of bayes classifier and clustering

Zhiqiang Li, Yuan Tan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Allowing for shortcomings of existing Chinese web page classification algorithms, a web page classification algorithm based on the combination of Bayesian classification and clustering is present in this paper. When maximum similarity difference between page and raining positive examples and negative examples of cluster centre is greater than a given threshold, it belongs to the current class. Otherwise, call the Bayesian classifier for classification. The experiment shows that this method can reduce the train scale of classifiers and improve the training efficiency. Its precision and recall are very good, and the test speed is also very high.

Original languageEnglish
Title of host publicationFuture Communication Technology
PublisherWITPress
Pages315-323
Number of pages9
ISBN (Print)9781845648633
DOIs
Publication statusPublished - 2014
Event2013 International Conference on Communication Technology, ICCT 2013 - , Singapore
Duration: 15 Nov 201316 Nov 2013

Publication series

NameWIT Transactions on Information and Communication Technologies
Volume51
ISSN (Print)1743-3517

Conference

Conference2013 International Conference on Communication Technology, ICCT 2013
Country/TerritorySingapore
Period15/11/1316/11/13

Keywords

  • Bayesian classification
  • Chinese web page
  • Clustering
  • Data mining

Fingerprint

Dive into the research topics of 'A Chinese web page classification algorithm based on combination of bayes classifier and clustering'. Together they form a unique fingerprint.

Cite this