A method of optimizing LDA result purity based on semantic similarity

Zhu Jingrui, Wang Qinglin, Liu Yu, Li Yuan

科研成果: 书/报告/会议事项章节会议稿件同行评审

7 引用 (Scopus)

摘要

The result purity of traditional LDA (Latent Dirichlet Allocation) is uninterpretable because it is always difficult to summarize the meaning of each LDA result topic which contains multiple irrelevant words. To solve the problem, a method of optimizing LDA result purity based on semantic similarity in streaming news processing is proposed. In this method, the Category Cluster Density (CCD) of each topic is calculated first, and those topics with lower CCD value were dropped to optimize the overall LDA result purity. The news clustering experiment results show that the vague news can be removed effectively and the reserved topics are interpretable than traditional method, which can significant optimize the LDA result purity automatically.

源语言英语
主期刊名Proceedings - 2017 32nd Youth Academic Annual Conference of Chinese Association of Automation, YAC 2017
出版商Institute of Electrical and Electronics Engineers Inc.
361-365
页数5
ISBN(电子版)9781538629017
DOI
出版状态已出版 - 30 6月 2017
活动32nd Youth Academic Annual Conference of Chinese Association of Automation, YAC 2017 - Hefei, 中国
期限: 19 5月 201721 5月 2017

出版系列

姓名Proceedings - 2017 32nd Youth Academic Annual Conference of Chinese Association of Automation, YAC 2017

会议

会议32nd Youth Academic Annual Conference of Chinese Association of Automation, YAC 2017
国家/地区中国
Hefei
时期19/05/1721/05/17

指纹

探究 'A method of optimizing LDA result purity based on semantic similarity' 的科研主题。它们共同构成独一无二的指纹。

引用此