A unified generative model for characterizing microblogs' topics

Kun Zhuang, Heyan Huang, Xin Xin, Xiaochi Wei, Xianxiang Yang, Chong Feng, Ying Fang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In this paper, we focus on the issue of characterizing microblogs' topics based on topic models. Different from dealing with traditional textual media (such as news documents), modeling microblogs has three challenges: 1) too much noise; 2) short text; and 3) content incompleteness. Previously, all these limitations have been investigated separately. Some work filters the noise through a prior classification; some enhances the text through the user's blog history; and some utilizes the social network. However, none of these work could solve all the above limitations simultaneously. To solve this problem, we make a combination of previous work in this paper, and propose a unified generative model for characterizing microblogs' topics. In the proposed unified approach, all the three limitations could be solved. A collapsed Gibbs-sampling optimization method is derived for estimating the parameters. Through both qualitative and quantitative analysis in Twitter, we demonstrate that our approach consistently outperforms previous methods at a significant scale.

源语言英语
主期刊名Web-Age Information Management - 14th International Conference, WAIM 2013, Proceedings
出版商Springer Verlag
583-594
页数12
ISBN(印刷版)9783642385612
DOI
出版状态已出版 - 2013
活动14th International Conference on Web-Age Information Management, WAIM 2013 - Beidaihe, 中国
期限: 14 6月 201316 6月 2013

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
7923 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议14th International Conference on Web-Age Information Management, WAIM 2013
国家/地区中国
Beidaihe
时期14/06/1316/06/13

指纹

探究 'A unified generative model for characterizing microblogs' topics' 的科研主题。它们共同构成独一无二的指纹。

引用此