DocumentCode :
2826426
Title :
Mining maximal patterns based on improved FP-tree and array technique
Author :
Wang, Huajin ; Hu, Chun´an ; Chen, Yuhuan
Author_Institution :
Sch. of Inf. Eng., Jiangxi Univ. of Sci. & Technol., Ganzhou, China
Volume :
3
fYear :
2010
fDate :
21-24 May 2010
Abstract :
Mining frequent itemsets is very important for mining association rules. However, because of the inherent complexity, mining complete frequent patterns from a dense database could be impractical, and the quantity of the mined patterns is usually very large. It is hard to understand and make use of them. Maximal frequent patterns contain and compress all frequent patterns, and the memory needed for saving them is much smaller than that needed for saving complete patterns. Thus it is greatly valuable to mine maximal frequent patterns. In this paper, the structure of a traditional FP-tree is improved and an efficient algorithm for mining maximal frequent patterns based on improved FP-tree and array technique, called IAFP-max, is presented. By introducing the concept of postfix sub-tree, the presented algorithm needn´t generate the candidate of maximal frequent patterns in mining process and therefore greatly reduces the memory consume, and it also uses an array-based technique to reduce the traverse time to the improved FP-tree. The experimental evaluation shows that this algorithm outperforms most exiting algorithms MAFIA, GenMax and FPmax.
Keywords :
data mining; pattern classification; trees (mathematics); IAFP-max; array technique; association rule; improved FP-tree technique; maximal pattern mining; Association rules; Character generation; Data mining; Educational technology; Frequency; Itemsets; Testing; Transaction databases; array technique; data mining; improved FP-tree; maximal frequent pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Future Computer and Communication (ICFCC), 2010 2nd International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5821-9
Type :
conf
DOI :
10.1109/ICFCC.2010.5497458
Filename :
5497458
Link To Document :
بازگشت