Text classification based on a novel Bayesian hierarchical model

Shibin Zhou*, Kan Li, Yushu Liu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the text literature, many Bayesian generative models were proposed to represent documents and words in order to process text effectively and accurately. As the most popular one of these models, Latent Dirichlet Allocation Model(LDA) did great job in dimensionality reduction for document classification. In this paper, inspiring by Latent Dirichlet Allocation Model, we propose LDCM or Latent Dirichlet Category Model for text classification rather than dimensionality reduction. LDCM estimate parameters of models by variational inference and use variational parameters to estimate maximum a posteriori of terms. As demonstrated by our experimental results, we report satisfactory categorization performances about our method on various real-world text documents.

Original languageEnglish
Title of host publicationProceedings - 5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008
Pages218-221
Number of pages4
DOIs
Publication statusPublished - 2008
Event5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008 - Jinan, Shandong, China
Duration: 18 Oct 200820 Oct 2008

Publication series

NameProceedings - 5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008
Volume2

Conference

Conference5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008
Country/TerritoryChina
CityJinan, Shandong
Period18/10/0820/10/08

Fingerprint

Dive into the research topics of 'Text classification based on a novel Bayesian hierarchical model'. Together they form a unique fingerprint.

Cite this