A hot topic detection method for Chinese Microblog based on topic words

Jun Zheng*, Yuanjun Li

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Microblog is a kind of new network medium which sprang up quickly. Detection and tracking of hot topics through Microblog has attracted wide attentions from scholars at home and abroad in recent years. The algorithm which aims at finding topics in long text messages such as in traditional news websites and blogs, etc. can't effectively be used in disposing the Microblog data with a property of sparseness. This paper contributes a method, which aims to identify hot topics in Microblog based on the topic words. This method, throughpre-treating the Microblog data and dividing the time-window, extracts topic words in the Microblog data according to the two factors of increasing rate of word frequency and relative word frequency from Microblog data in every time-window. And then extracts and clusters the topic words according to the similarity among them, sieving for a suitable cluster of topic words so as to describe the hot topic and realize the detection of hot topic in Microblog. Through experimental verification, this method can improve the efficiency of detection to a certain extent, and raise the recall ratio and the precision ratio, so as to find hot topic in Microblog effectively and timely.

Original languageEnglish
Title of host publicationProceedings of 2nd International Conference on Information Technology and Electronic Commerce, ICITEC 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages262-266
Number of pages5
ISBN (Electronic)9781479952984
DOIs
Publication statusPublished - 11 May 2014
Event2nd International Conference on Information Technology and Electronic Commerce, ICITEC 2014 - Dalian, China
Duration: 20 Dec 201421 Dec 2014

Publication series

NameProceedings of 2nd International Conference on Information Technology and Electronic Commerce, ICITEC 2014

Conference

Conference2nd International Conference on Information Technology and Electronic Commerce, ICITEC 2014
Country/TerritoryChina
CityDalian
Period20/12/1421/12/14

Keywords

  • Microblog
  • TDT
  • clustering algorithm
  • hot topic
  • topic
  • word

Fingerprint

Dive into the research topics of 'A hot topic detection method for Chinese Microblog based on topic words'. Together they form a unique fingerprint.

Cite this