High-order concept associations mining and inferential language modeling for online review spam detection

C. L. Lai*, K. Q. Xu, Raymond Y.K. Lau, Yuefeng Li, Dawei Song

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Citations (Scopus)

Abstract

Despite many incidents about fake online consumer reviews have been reported, very few studies have been conducted to date to examine the trustworthiness of online consumer reviews. One of the reasons is the lack of an effective computational method to separate the untruthful reviews (i.e., spam) from the legitimate ones (i.e., ham) given the fact that prominent spam features are often missing in online reviews. The main contribution of our research work is the development of a novel review spam detection method which is underpinned by an unsupervised inferential language modeling framework. Another contribution of this work is the development of a high-order concept association mining method which provides the essential term association knowledge to bootstrap the performance for untruthful review detection. Our experimental results confirm that the proposed inferential language model equipped with high-order concept association knowledge is effective in untruthful review detection when compared with other baseline methods.

Original languageEnglish
Title of host publicationProceedings - 10th IEEE International Conference on Data Mining Workshops, ICDMW 2010
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1120-1127
Number of pages8
ISBN (Print)9780769542577, 9780769542577
DOIs
Publication statusPublished - 2010
Externally publishedYes

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Keywords

  • Kullback-leibler divergence
  • Language modeling
  • Review spam
  • Spam detection
  • Text mining

Fingerprint

Dive into the research topics of 'High-order concept associations mining and inferential language modeling for online review spam detection'. Together they form a unique fingerprint.

Cite this