Title :
A tree structure frequent pattern mining algorithm based on hybrid search strategy and bitmap
Author :
Qiao, Mei ; Yang, Liu
Author_Institution :
Tianjin Key Lab. of Intell. Comput. & Novel Software Technol., Tianjin Univ. of Technol.(TJUT), Tianjin, China
Abstract :
This paper proposes a novel vertical format-based frequent pattern mining algorithm HBMFP. HBMFP adopts a hybrid search strategy of prefix-depth-first and depth-first searches based on the correlative array, which adequately makes use of the advantages of the both searches to effectively reduce yielded candidates as the same time keeps simplicity and lower memory cost. HBMFP uses bitmaps to store the tidsets of itemsets and adopts bitmap projection to compress bitmaps so as to save the time spent in intersecting bitmaps. HBMFP can output the mining results in the sets of itemsets or in tree structure called frequent pattern tree, the later not only takes up less storage space, but also facilitates implementing efficient pattern matching and the visualization of the mining results. Analyses and experiments show that HBMFP has higher mining efficiency, less memory cost, good scalability and operability. Moreover, it can be used in parallel mining.
Keywords :
data mining; data visualisation; tree data structures; tree searching; bitmap projection; correlative array; frequent pattern tree; hybrid search strategy; mining results visualization; pattern matching; prefix-depth-first search; tree structure frequent pattern mining; vertical format-based frequent pattern mining; Association rules; Clustering algorithms; Costs; Data mining; Itemsets; Paper technology; Pattern matching; Software algorithms; Tree data structures; Visualization; bitmap; frequent itemset mining; frequent pattern tree; hybrid search strategy; prefix-depth-first search;
Conference_Titel :
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-4754-1
Electronic_ISBN :
978-1-4244-4738-1
DOI :
10.1109/ICICISYS.2009.5357805