Text classification using diffusion kernel on statistical manifold

Kan Li*, Shi Bin Zhou, Yu Shu Liu

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

2 引用 (Scopus)

摘要

Dirichlet compound multinomial manifold (DCM manifold) is proposed. DCM manifold with positive sphere manifold is homeomorphic and isometric, so the geodesic distance of positive sphere manifold can be mapped as the geodesic distance of DCM manifold through pullback mapping. Then the distance metric is built on DCM manifold. DCM diffusion kernel function and DCMIDF diffusion kernel function are built on DCM manifold. The performance of the proposed algorithms for text classification are tested on the corpuses of WebKB Top 4 and 20 Newsgroups, and the experimental results show that DCM manifold is more desirable than that of Euclidean space in modeling texts on the corpuses. Compared with polynomial kernel based support vector machine and NGD kernel based support vector machine, the proposed DCM diffusion kernel and DCMIDF diffusion kernel based support vector machine algorithms show better computational accuracy for text classification.

源语言英语
页(从-至)339-345
页数7
期刊Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence
25
2
出版状态已出版 - 4月 2012

指纹

探究 'Text classification using diffusion kernel on statistical manifold' 的科研主题。它们共同构成独一无二的指纹。

引用此