TY - GEN
T1 - Distributed collaborative filtering recommendation model based on two-phase similarity
AU - Wang, C. Q.
AU - Su, H. Y.
AU - Zhu, Y.
AU - Li, F. X.
AU - Yan, B.
N1 - Publisher Copyright:
© 2015 Taylor & Francis Group, London.
PY - 2015
Y1 - 2015
N2 - The recommendation system based on collaborative filtering is one of the most popular recommendation mechanisms. However, with the continuous expansion of the system, several problems that traditional Collaborative Filtering recommendation algorithm (CF) faced such as cold startup, accuracy, and scalability are worsened. In order to address these issues, a Distributed Collaborative Filtering recommendation model based on Two-Phase similarity (DCF-TP) is proposed. DCF-TP is based on Weighted Distance Similarity Measure (WDSM), a new measure created in this paper. According to WDSM, the similarity of the users is calculated and the similarity matrix of users is obtained, meanwhile, in line with the co-occurrence matrix method, the similarity of items is counted, getting the co-occurrence matrix of the items. With regard to the similarity matrix of users, their preferences are endowed with weights and the new preferences matrix of users is received. Besides, on the basis of the co-occurrence matrix of the items and the new preferences matrix of users, the nearest neighbor item is found and a more accurate recommendation to the target user is given. Furthermore, in terms of the parallel computing framework, the distributed implementation of DCF-TP is completed. All these experiments are done on MovieLens dataset. The results show that DCF-TP overcomes the problem of cold startup and has a qualitative leap both in the aspects of precision and recall ratio. With the increasing numbers of the computing nodes, the distributed algorithm has achieved higher linear speedup.
AB - The recommendation system based on collaborative filtering is one of the most popular recommendation mechanisms. However, with the continuous expansion of the system, several problems that traditional Collaborative Filtering recommendation algorithm (CF) faced such as cold startup, accuracy, and scalability are worsened. In order to address these issues, a Distributed Collaborative Filtering recommendation model based on Two-Phase similarity (DCF-TP) is proposed. DCF-TP is based on Weighted Distance Similarity Measure (WDSM), a new measure created in this paper. According to WDSM, the similarity of the users is calculated and the similarity matrix of users is obtained, meanwhile, in line with the co-occurrence matrix method, the similarity of items is counted, getting the co-occurrence matrix of the items. With regard to the similarity matrix of users, their preferences are endowed with weights and the new preferences matrix of users is received. Besides, on the basis of the co-occurrence matrix of the items and the new preferences matrix of users, the nearest neighbor item is found and a more accurate recommendation to the target user is given. Furthermore, in terms of the parallel computing framework, the distributed implementation of DCF-TP is completed. All these experiments are done on MovieLens dataset. The results show that DCF-TP overcomes the problem of cold startup and has a qualitative leap both in the aspects of precision and recall ratio. With the increasing numbers of the computing nodes, the distributed algorithm has achieved higher linear speedup.
KW - Collaborative filtering
KW - Data mining
KW - Distributed applications
KW - Double similarity
UR - http://www.scopus.com/inward/record.url?scp=84940500140&partnerID=8YFLogxK
U2 - 10.1201/b18049-29
DO - 10.1201/b18049-29
M3 - Conference contribution
AN - SCOPUS:84940500140
SN - 9781138026537
T3 - Future Communication, Information and Computer Science - Proceedings of the International Conference on Future Communication, Information and Computer Science, FCICS 2014
SP - 123
EP - 126
BT - Future Communication, Information and Computer Science - Proceedings of the International Conference on Future Communication, Information and Computer Science, FCICS 2014
A2 - Zheng, Dawei
PB - CRC Press/Balkema
T2 - International Conference on Future Communication, Information and Computer Science, FCICS 2014
Y2 - 22 May 2014 through 23 May 2014
ER -