Online topic evolution modeling based on hierarchical dirichlet process

Tao Ma, Dacheng Qu, Rui Ma, Wei Feng, Kan Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

This paper presents a model based on Hierarchical Dirichlet Process (HDP), that automatically captures the evolutionary thematic patterns in texts. Our approach allows HDP to work in an online fashion, such that it can build an up-todate model for new documents given the old model, without accessing historic data. Since exact calculation is infeasible, we turn to Gibbs sampling to carry out approximate posterior inference. After the topics are found, we can analyze the evolution relationships between time-adjacent topics. Experiments on a real world dataset (Reuters-21578) validate the effectiveness of the model quantitatively, showing its advantage over both OLDA and plain HDP in modeling topic evolution.

源语言英语
主期刊名Proceedings - 2016 IEEE 1st International Conference on Data Science in Cyberspace, DSC 2016
出版商Institute of Electrical and Electronics Engineers Inc.
400-405
页数6
ISBN(电子版)9781509011926
DOI
出版状态已出版 - 27 2月 2017
活动1st IEEE International Conference on Data Science in Cyberspace, DSC 2016 - Changsha, Hunan, 中国
期限: 13 6月 201616 6月 2016

出版系列

姓名Proceedings - 2016 IEEE 1st International Conference on Data Science in Cyberspace, DSC 2016

会议

会议1st IEEE International Conference on Data Science in Cyberspace, DSC 2016
国家/地区中国
Changsha, Hunan
时期13/06/1616/06/16

指纹

探究 'Online topic evolution modeling based on hierarchical dirichlet process' 的科研主题。它们共同构成独一无二的指纹。

引用此