DocumentCode :
3724578
Title :
Parallelization of association rule mining: Survey
Author :
Shivani Sharma;Durga Toshniwal
Author_Institution :
Dept. of Computer Science & Engineering, Indian Institute of Technology, Roorkee, Uttarakhand, India 247667
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
In todays big data era, all modern applications are generating and collecting large amount of data. As a result, data mining is encountering new challenges and opportunities to make algorithms such that, this voluminous data can be effectively and efficiently transformed into actionable knowledge . Traditional algorithms were designed to run sequentially over a single machine. But, as the volume of data increases computational cost associated with its processing also increases. This causes problems in analysing data on a single sequential machine and instead of assisting in data analysis, the processor serve more like a bottleneck. Parallel and distributed approaches improve the performance in terms of computational cost as well as scalability but experience some limitations during load balancing, data partitioning, job assignment, monitoring etc. MapReduce, a parallel programming model is a new concept which provides seemingly unlimited computing power, cheap storage as well as, can overcome above limitations. This makes it a topic of upcoming research interest. A detailed literature review of some existing methods is given along with their pros and cons.
Keywords :
"Sociology","Statistics","Computers","Training","Genetic programming","Conferences"
Publisher :
ieee
Conference_Titel :
Computing, Communication and Security (ICCCS), 2015 International Conference on
Type :
conf
DOI :
10.1109/CCCS.2015.7374209
Filename :
7374209
Link To Document :
بازگشت