Text Categorization Based on Topic Model

Shibin Zhou*, Kan Li, Yushu Liu

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 1
  • Captures
    • Readers: 33
see details

摘要

In the text literature, many topic models were proposed to represent documents and words as topics or latent topics in order to process text effectively and accurately. In this paper, we propose LDACLM or Latent Dirichlet Allocation Category Language Model for text categorization and estimate parameters of models by variational inference. As a variant of Latent Dirichlet Allocation Model, LDACLM regards documents of category as Language Model and uses variational parameters to estimate maximum a posteriori of terms. In general, experiments show LDACLM model is effective and outperform Naïve Bayes with Laplace smoothing and Rocchio algorithm but little inferior to SVM for text categorization.

源语言英语
页(从-至)398-409
页数12
期刊International Journal of Computational Intelligence Systems
2
4
DOI
出版状态已出版 - 12月 2009

指纹

探究 'Text Categorization Based on Topic Model' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhou, S., Li, K., & Liu, Y. (2009). Text Categorization Based on Topic Model. International Journal of Computational Intelligence Systems, 2(4), 398-409. https://doi.org/10.2991/ijcis.2009.2.4.8