On improving a microblog ranking

Jidong Li, Xin Li, Mingming Shi, Meng Zhou, Linjing Lai

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Microblog ranking is a hot research topic in recent years. Most of the related works apply TF-IDF metric for calculating content similarity while neglecting their semantic similarity. And most existing search engines which retrieve the microblog list by string matching the search keywords is not competent to provide a reliable list for users when dealing with polysemy and synonym. Besides, treating all the users with same authority for all topics is intuitively not ideal. In this paper, a comprehensive strategy for microblog ranking is proposed. First, we extend the conventional TF-IDF based content similarity with exploiting knowledge from WordNet. Then, we further incorporate a new feature for microblog ranking that is the topical relation between search keyword and its retrieval. Author topical authority is also incorporated into the ranking framework as an important feature for microblog ranking. Gradient Boosting Decision Tree(GBDT), then is employed to train the ranking model with multiple features involved. We conduct thorough experiments on a large-scale real-world Twitter dataset and demonstrate that our proposed approach outperform a number of existing approaches in discovering higher quality and more related microblogs.

源语言英语
主期刊名Proceedings - 2016 IEEE 1st International Conference on Data Science in Cyberspace, DSC 2016
出版商Institute of Electrical and Electronics Engineers Inc.
268-274
页数7
ISBN(电子版)9781509011926
DOI
出版状态已出版 - 27 2月 2017
活动1st IEEE International Conference on Data Science in Cyberspace, DSC 2016 - Changsha, Hunan, 中国
期限: 13 6月 201616 6月 2016

出版系列

姓名Proceedings - 2016 IEEE 1st International Conference on Data Science in Cyberspace, DSC 2016

会议

会议1st IEEE International Conference on Data Science in Cyberspace, DSC 2016
国家/地区中国
Changsha, Hunan
时期13/06/1616/06/16

指纹

探究 'On improving a microblog ranking' 的科研主题。它们共同构成独一无二的指纹。

引用此