Improving top- N recommendation performance using missing data

Xiangyu Zhao*, Zhendong Niu, Kaiyi Wang, Ke Niu, Zhongqiang Liu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Citations (Scopus)

Abstract

Recommender systems become increasingly significant in solving the information explosion problem. Data sparse is a main challenge in this area. Massive unrated items constitute missing data with only a few observed ratings. Most studies consider missing data as unknown information and only use observed data to learn models and generate recommendations. However, data are missing not at random. Part of missing data is due to the fact that users choose not to rate them. This part of missing data is negative examples of user preferences. Utilizing this information is expected to leverage the performance of recommendation algorithms. Unfortunately, negative examples are mixed with unlabeled positive examples in missing data, and they are hard to be distinguished. In this paper, we propose three schemes to utilize the negative examples in missing data. The schemes are then adapted with SVD++, which is a state-of-the-art matrix factorization recommendation approach, to generate recommendations. Experimental results on two real datasets show that our proposed approaches gain better top-N performance than the baseline ones on both accuracy and diversity.

Original languageEnglish
Article number380472
JournalMathematical Problems in Engineering
Volume2015
DOIs
Publication statusPublished - 2015

Fingerprint

Dive into the research topics of 'Improving top- N recommendation performance using missing data'. Together they form a unique fingerprint.

Cite this