DocumentCode :
2924320
Title :
A novel frequent pattern mining algorithm for very large databases in cloud computing environments
Author :
Lin, Kawuu W. ; Chen, Pei-Ling ; Chang, Weng-Long
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Kaohsiung Univ. of Appl. Sci., Kaohsiung, Taiwan
fYear :
2011
fDate :
8-10 Nov. 2011
Firstpage :
399
Lastpage :
403
Abstract :
FP-growth is the most famous algorithm for discovering frequent patterns. As the database size growths or the minimum support decreases, however, both of the memory requirement and execution time increase greatly. Many researchers tried to solve this problem by utilizing distributed computing techniques to improve the scalability and execution efficiency. In this paper, we propose a method for discovering frequent patterns from very large database in cloud computing environments. To build the whole FP-Tree, we use the disk as the secondary memory. Because the disk access is much slower than main memory, an efficient data structure for storing and retrieving FP-Tree from disk is also proposed. Through empirical evaluations on various simulation conditions, the proposed method delivers excellent performance in terms of scalability and execution time.
Keywords :
cloud computing; data mining; trees (mathematics); very large databases; FP-Tree; FP-growth; cloud computing environments; data structure; distributed computing techniques; novel frequent pattern mining algorithm; very large databases; Cloud computing; Data mining; Itemsets; Kernel; Memory management; Scalability; Clustering; Data Mining; Distributed Computing formatting; Frequent Pattern Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2011 IEEE International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-4577-0372-0
Type :
conf
DOI :
10.1109/GRC.2011.6122630
Filename :
6122630
Link To Document :
بازگشت