基于改进LDA的在线医疗评论主题挖掘

Translated title of the contribution: Identifying Topics of Online Healthcare Reviews Based on Improved LDA

Hui Ying Gao, Jia Wei Liu, Shu Xin Yang

    Research output: Contribution to journalArticlepeer-review

    17 Citations (Scopus)

    Abstract

    An in-depth research was conducted on the use of topic models to identify the topics of healthcare services. In view of semantic sparseness and the lack of co-occurrence information in the special extraction of healthcare reviews in the LDA topic model, a CO-LDA model was proposed based on word co-occurrence analysis combined with LDA topic model. Firstly, the word co-occurrence analysis method was used to analyze the corpus of the review and the word co-occurrence matrix was obtained. Secondly, the LDA topic model was used to represent corpus reviews, and then the hierarchical clustering algorithm was used to classify the features. Finally, patients' focus on healthcare service quality factors was identified. Based on the average minimum JS distance, the average Kendall correlation coefficient and the average TF-IDF, in this paper the CO-LDA model was compared with the traditional LDA model. The experiment finally shows that the recognition theme consistency of CO-LDA model is better than that of the LDA model. Through the comparison of the experimental results with the "Hospital Evaluation Standards" in China, it is found that the consistency of the former was high, which explains the effectiveness of the CO-LDA-based online medical review topic mining method.

    Translated title of the contributionIdentifying Topics of Online Healthcare Reviews Based on Improved LDA
    Original languageChinese (Traditional)
    Pages (from-to)427-434
    Number of pages8
    JournalBeijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
    Volume39
    Issue number4
    DOIs
    Publication statusPublished - 1 Apr 2019

    Fingerprint

    Dive into the research topics of 'Identifying Topics of Online Healthcare Reviews Based on Improved LDA'. Together they form a unique fingerprint.

    Cite this