Micro-blog post topic drift detection based on LDA model

Quanchao Liu, Heyan Huang, Chong Feng

科研成果: 书/报告/会议事项章节会议稿件同行评审

7 引用 (Scopus)

摘要

Micro-blog posts imply a large number of topics, which contain a lot of useful information as well as a lot of junk information making the micro-blog post topic a characteristic of high drift. The changes of micro-blog post topic over time and noises introduced with the increase of the number of micro-blog posts are two main aspects of micro-blog post topic drift. We propose a method of topic drift detection based on LDA model, using Gibbs sampling algorithm to obtain the probability distribution of micro-blog post words based on words correlation, identifying the topic boundary in dynamic constant method, extracting topic words by computing lexical information entropy in the topic field, and detecting the topic drift by topic words sequence alignment based on discrete-time model. According to the experiment on topic drift detection based on LDA model, we find our method very effective in micro-blog post topic drift detection.

源语言英语
主期刊名Behavior and Social Computing - Int. Workshop on Behavior and Social Informatics, BSI 2013 and Int. Workshop on Behavior and Social Informatics and Computing, BSIC 2013, Revised Selected Papers
出版商Springer Verlag
106-118
页数13
ISBN(印刷版)9783319040479
DOI
出版状态已出版 - 2013
活动2013 International Workshop on Behavior and Social Informatics and Computing, BSIC 2013 - Beijing, 中国
期限: 3 8月 20139 8月 2013

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
8178 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议2013 International Workshop on Behavior and Social Informatics and Computing, BSIC 2013
国家/地区中国
Beijing
时期3/08/139/08/13

指纹

探究 'Micro-blog post topic drift detection based on LDA model' 的科研主题。它们共同构成独一无二的指纹。

引用此