Online topic evolution modeling based on hierarchical dirichlet process

Tao Ma, Dacheng Qu, Rui Ma, Wei Feng, Kan Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

This paper presents a model based on Hierarchical Dirichlet Process (HDP), that automatically captures the evolutionary thematic patterns in texts. Our approach allows HDP to work in an online fashion, such that it can build an up-todate model for new documents given the old model, without accessing historic data. Since exact calculation is infeasible, we turn to Gibbs sampling to carry out approximate posterior inference. After the topics are found, we can analyze the evolution relationships between time-adjacent topics. Experiments on a real world dataset (Reuters-21578) validate the effectiveness of the model quantitatively, showing its advantage over both OLDA and plain HDP in modeling topic evolution.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE 1st International Conference on Data Science in Cyberspace, DSC 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages400-405
Number of pages6
ISBN (Electronic)9781509011926
DOIs
Publication statusPublished - 27 Feb 2017
Event1st IEEE International Conference on Data Science in Cyberspace, DSC 2016 - Changsha, Hunan, China
Duration: 13 Jun 201616 Jun 2016

Publication series

NameProceedings - 2016 IEEE 1st International Conference on Data Science in Cyberspace, DSC 2016

Conference

Conference1st IEEE International Conference on Data Science in Cyberspace, DSC 2016
Country/TerritoryChina
CityChangsha, Hunan
Period13/06/1616/06/16

Fingerprint

Dive into the research topics of 'Online topic evolution modeling based on hierarchical dirichlet process'. Together they form a unique fingerprint.

Cite this