Title :
Mining top-k frequent closed patterns without minimum support
Author :
Han, Jiawei ; Wang, Jianyong ; Lu, Ying ; Tzvetkov, Petre
Author_Institution :
Illinois Univ., Urbana, IL, USA
Abstract :
In this paper, we propose a new mining task: mining top-k frequent closed patterns of length no less than min_ℓ, where k is the desired number of frequent closed patterns to be mined, and min_ℓ is the minimal length of each pattern. An efficient algorithm, called TFP, is developed for mining such patterns without minimum support. Two methods, closed-node-count and descendant-sum are proposed to effectively raise support threshold and prune FP-tree both during and after the construction of FP-tree. During the mining process, a novel top-down and bottom-up combined FP-tree mining strategy is developed to speed-up support-raising and closed frequent pattern discovering. In addition, a fast hash-based closed pattern verification scheme has been employed to check efficiently if a potential closed pattern is really closed. Our performance study shows that in most cases, TFP outperforms CLOSET and CHARM, two efficient frequent closed pattern mining algorithms, even when both are running with the best tuned min-support. Furthermore, the method can be extended to generate association rules and to incorporate user-specified constraints.
Keywords :
data mining; trees (mathematics); FP-tree pruning; TFP; closed frequent pattern discovery; closed-node-count method; descendant-sum method; efficient algorithm; fast hash-based closed pattern verification scheme; minimum support; support threshold; support-raising; top-k frequent closed pattern mining; Association rules; Data mining; Transaction databases;
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
DOI :
10.1109/ICDM.2002.1183905