Title :
Recommending similar items in large-scale online marketplaces
Author :
Katukuri, Jayasimha ; Konik, Tolga ; Mukherjee, Rohan ; Kolay, Santanu
Author_Institution :
Univ. of Louisiana, Lafayette, LA, USA
Abstract :
We are proposing a new similarity based recommendation system for large-scale dynamic marketplaces. Our solution consists of an offline process, which generates long-term cluster definitions grouping short-lived item listings, and an online system, which utilizes these clusters to first focus on important similarity dimensions and next conducts a trade-off between further similarity and other quality factors such as seller trustworthiness. Our system generates these clusters from several hundred millions of item listings using a large Hadoop map-reduce based system. The clusters are learned using user queries as the main information source and therefore biased towards how users conceptually group items. Our system is deployed on several eBay sites in large-scale and has increased user-engagement and business metrics compared to the previous system. We show that utilizing user queries helps capturing similarity better. We also present experiments demonstrating that adapting the ranking function, which controls the trade-off between similarity and quality, to a specific context improves recommendation performance.
Keywords :
Internet; pattern clustering; query processing; recommender systems; retail data processing; Hadoop Map-Reduce based system; business metrics; eBay sites; information source; large-scale online marketplaces; long-term cluster definitions; quality factors; ranking function; recommendation system; seller trustworthiness; similar items recommendation; user queries; user-engagement; Computational modeling; Context; Data models; Dictionaries; Feature extraction; Q-factor; Runtime; Clustering; Hadoop; Recommender systems; Similarity-based recommendations;
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/BigData.2014.7004317