مرکز منطقه ای اطلاع رساني علوم و فناوري - The k-Nearest Neighbor Algorithm Using MapReduce Paradigm

DocumentCode :

3661632

Title :

The k-Nearest Neighbor Algorithm Using MapReduce Paradigm

Author :

Prajesh P. Anchalia;Kaushik Roy

Author_Institution :

Dept. of Comput. Sci. &

fYear :

2014

Firstpage :

513

Lastpage :

518

Abstract :

Data in any form is a valuable resource but more often than not data collected in the real world is completely random and unstructured. Hence, to utilize the true potential of data as a resource we must transform it in such a manner so as to retrieve meaningful information from it. Data mining fulfills this need. Today there is not only a need for efficient data mining techniques to process large volume of data but also a need for a means to meet the computational requirements to process such huge volume of data. In this paper we implement an effective data mining technique known as the k-Nearest Neighbor method on a distributed computing environment running Apache Hadoop that uses the MapReduce paradigm to process high volume data.

Keywords :

"Data mining","Testing","Classification algorithms","Training data","Training","Distributed computing","Algorithm design and analysis"

Publisher :

ieee

Conference_Titel :

Intelligent Systems, Modelling and Simulation (ISMS), 2014 5th International Conference on

ISSN :

2166-0662

Type :

conf

DOI :

10.1109/ISMS.2014.94

Filename :

7280963

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3661632