SSHLDA: A semi-supervised hierarchical topic model

Xian Ling Mao, Zhao Yan Ming, Tat Seng Chua, Si Li, Hongfei Yan*, Xiaoming Li

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

61 引用 (Scopus)

摘要

Supervised hierarchical topic modeling and unsupervised hierarchical topic modeling are usually used to obtain hierarchical topics, such as hLLDA and hLDA. Supervised hierarchical topic modeling makes heavy use of the information from observed hierarchical labels, but cannot explore new topics; while unsu-pervised hierarchical topic modeling is able to detect automatically new topics in the data space, but does not make use of any information from hierarchical labels. In this paper, we propose a semi-supervised hierarchical topic model which aims to explore new topics automatically in the data space while incorporating the information from observed hierarchical labels into the modeling process, called Semi-Supervised Hierarchical Latent Dirichlet Allocation (SSHLDA). We also prove that hLDA and hLLDA are special cases of SSHLDA. We conduct experiments on Yahoo! Answers and ODP datasets, and assess the performance in terms of perplexity and clustering. The experimental results show that predictive ability of SSHLDA is better than that of baselines, and SSHLDA can also achieve significant improvement over baselines for clustering on the FScore measure.

源语言英语
主期刊名EMNLP-CoNLL 2012 - 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the Conference
800-809
页数10
出版状态已出版 - 2012
已对外发布
活动2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL 2012 - Jeju Island, 韩国
期限: 12 7月 201214 7月 2012

出版系列

姓名EMNLP-CoNLL 2012 - 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the Conference

会议

会议2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL 2012
国家/地区韩国
Jeju Island
时期12/07/1214/07/12

指纹

探究 'SSHLDA: A semi-supervised hierarchical topic model' 的科研主题。它们共同构成独一无二的指纹。

引用此