摘要
The two crucial issues for hot topics discovery based on online reviews are that the sparsity of short text features and the "long tail" phenomenon of hot topics. This paper focuses on these two key issues, and proposes an improved similarity calculation method to calculate the similarity of short texts, and a novel clustering algorithm based on the time factor and dynamic adjustment of comparison times to automatically discard a large number of outliers. Moreover, the validity and advancement of the new method are presented by comparative experiments using real data sets.
源语言 | 英语 |
---|---|
文章编号 | 012046 |
期刊 | Journal of Physics: Conference Series |
卷 | 1288 |
期 | 1 |
DOI | |
出版状态 | 已出版 - 15 8月 2019 |
活动 | 5th Annual International Conference on Network and Information Systems for Computers, ICNISC 2019 - Wuhan, 中国 期限: 19 4月 2019 → 20 4月 2019 |
指纹
探究 'Research on Hot Topics Discovery Based on Short Texts of Online Reviews' 的科研主题。它们共同构成独一无二的指纹。引用此
Liu, Z., Qiu, Y., Zhang, Z., Mao, L., & Zheng, X. (2019). Research on Hot Topics Discovery Based on Short Texts of Online Reviews. Journal of Physics: Conference Series, 1288(1), 文章 012046. https://doi.org/10.1088/1742-6596/1288/1/012046