• DocumentCode
    124172
  • Title

    Improving Collaborative Filtering Based Recommenders Using Topic Modelling

  • Author

    Wilson, James ; Chaudhury, Santanu ; Lall, Brejesh

  • Author_Institution
    R&D Dept., Flytxt Mobile Solutions Pvt. Ltd., Trivandrum, India
  • Volume
    1
  • fYear
    2014
  • fDate
    11-14 Aug. 2014
  • Firstpage
    340
  • Lastpage
    346
  • Abstract
    Standard Collaborative Filtering (CF) algorithms make use of interactions between users and items in the form of implicit or explicit ratings alone for generating recommendations. Similarity among users or items is calculated purely based on rating overlap in this case, without considering explicit properties of users or items involved, limiting their applicability in domains with very sparse rating spaces. In many domains such as movies, news or electronic commerce recommenders, considerable contextual data in text form describing item properties is available along with the rating data, which could be utilized to improve recommendation quality. In this paper, we propose a novel approach to improve standard CF based recommenders by utilizing latent Dirichlet allocation (LDA) to learn latent properties of items, expressed in terms of topic proportions, derived from their textual description. We infer user´s topic preferences or user profile in the same latent space, based on her historical ratings. While computing similarity between users, we make use of a combined similarity measure involving rating overlap as well as similarity in the latent topic space. This approach alleviates sparsity problem as it allows calculation of similarity between users even if they have not rated any items in common. Our experiments on multiple public datasets indicate that the proposed hybrid approach significantly outperforms standard User Based and Item Based CF recommenders in terms of classification accuracy metrics such as precision, recall and F-measure.
  • Keywords
    classification; collaborative filtering; recommender systems; LDA; classification accuracy metrics; collaborative filtering based recommenders; contextual data; electronic commerce recommenders; explicit ratings; implicit ratings; item latent properties; latent Dirichlet allocation; movie recommenders; news recommenders; rating data; rating overlap; similarity measure; topic modelling; user profile; user topic preferences; Collaboration; Motion pictures; Recommender systems; Standards; Testing; Training; Vectors; Collaborative Filtering; Latent Dirichlet allocation (LDA); Recommender systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Warsaw
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2014.54
  • Filename
    6927563