Title :
MREclat: An Algorithm for Parallel Mining Frequent Itemsets
Author :
Zhigang Zhang ; Genlin Ji ; Mengmeng Tang
Author_Institution :
Sch. of Comput. Sci. & Technol., Nanjing Normal Univ., Nanjing, China
Abstract :
Algorithm Eclat is a classical algorithm for mining frequent itemsets, which is based on vertical layout databases. It is greatly different from those algorithms based on horizontal layout databases, such as algorithm Apriori and FP-Growth. In order to improve the efficiency of mining frequent itemsets from massive datasets, parallel algorithm MREclat based on Map/Reduce framework is presented. The algorithm also overcomes the problem of memory and computational capability insufficient when mining frequent itemsets from massive datasets. In this paper, the idea of MREclat is introduced and the performance of the algorithm is studied. The experimental results show that algorithm MREclat has high scalability and good speedup.
Keywords :
data mining; parallel algorithms; parallel programming; FP-growth algorithm; MREclat algorithm; MapReduce framework; apriori algorithm; computational capability problem; horizontal layout databases; massive datasets; memory problem; parallel algorithm; parallel frequent itemset mining; vertical layout databases; Algorithm design and analysis; Association rules; Itemsets; Layout; Parallel algorithms; Eclat; Frequent Itemset Mining; Map/Reduce; Parallel Mining Algorithm;
Conference_Titel :
Advanced Cloud and Big Data (CBD), 2013 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4799-3260-3
DOI :
10.1109/CBD.2013.22