مرکز منطقه ای اطلاع رساني علوم و فناوري - Collaborative filtering recommendation algorithm based on Hadoop and Spark

DocumentCode :

2830859

Title :

Collaborative filtering recommendation algorithm based on Hadoop and Spark

Author :

Kupisz, Bartosz ; Unold, Olgierd

Author_Institution :

Dept. of Comput. Eng., Wroclaw Univ. of Technol., Wroclaw, Poland

fYear :

2015

fDate :

17-19 March 2015

Firstpage :

1510

Lastpage :

1514

Abstract :

The aim of this work was to develop and compare recommendation systems which use the item-based collaborative filtering algorithm, based on Hadoop and Spark. Data for the research were gathered from a real social portal the users of which can express their preferences regarding the applications on offer. The Hadoop version was implemented with the use of the Mahout library which was an element of the Hadoop ecosystem. The authors original solution was implemented with the use of the Apache Spark platform and the Scala programming language. The applied similarity measure was the Tanimoto coefficient which provides the most precise results for the available data. The initial assumptions were confirmed as the solution based on the Apache Spark platform turned out to be more efficient.

Keywords :

collaborative filtering; data handling; portals; recommender systems; Apache Spark platform; Hadoop ecosystem; Hadoop version; Mahout library; Scala programming language; Tanimoto coefficient; collaborative filtering recommendation algorithm; item-based collaborative filtering algorithm; recommendation systems; Big data; Collaboration; Computer languages; Computers; Libraries; Machine learning algorithms; Sparks;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Industrial Technology (ICIT), 2015 IEEE International Conference on

Conference_Location :

Seville

Type :

conf

DOI :

10.1109/ICIT.2015.7125310

Filename :

7125310

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2830859