Title :
Improving Collaborative Filtering Based Recommenders Using Topic Modelling
Author :
Wilson, James ; Chaudhury, Santanu ; Lall, Brejesh
Author_Institution :
R&D Dept., Flytxt Mobile Solutions Pvt. Ltd., Trivandrum, India
Abstract :
Standard Collaborative Filtering (CF) algorithms make use of interactions between users and items in the form of implicit or explicit ratings alone for generating recommendations. Similarity among users or items is calculated purely based on rating overlap in this case, without considering explicit properties of users or items involved, limiting their applicability in domains with very sparse rating spaces. In many domains such as movies, news or electronic commerce recommenders, considerable contextual data in text form describing item properties is available along with the rating data, which could be utilized to improve recommendation quality. In this paper, we propose a novel approach to improve standard CF based recommenders by utilizing latent Dirichlet allocation (LDA) to learn latent properties of items, expressed in terms of topic proportions, derived from their textual description. We infer user´s topic preferences or user profile in the same latent space, based on her historical ratings. While computing similarity between users, we make use of a combined similarity measure involving rating overlap as well as similarity in the latent topic space. This approach alleviates sparsity problem as it allows calculation of similarity between users even if they have not rated any items in common. Our experiments on multiple public datasets indicate that the proposed hybrid approach significantly outperforms standard User Based and Item Based CF recommenders in terms of classification accuracy metrics such as precision, recall and F-measure.
Keywords :
classification; collaborative filtering; recommender systems; LDA; classification accuracy metrics; collaborative filtering based recommenders; contextual data; electronic commerce recommenders; explicit ratings; implicit ratings; item latent properties; latent Dirichlet allocation; movie recommenders; news recommenders; rating data; rating overlap; similarity measure; topic modelling; user profile; user topic preferences; Collaboration; Motion pictures; Recommender systems; Standards; Testing; Training; Vectors; Collaborative Filtering; Latent Dirichlet allocation (LDA); Recommender systems;
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Warsaw
DOI :
10.1109/WI-IAT.2014.54