• DocumentCode
    1894759
  • Title

    An Efficient Close Frequent Pattern Mining Algorithm

  • Author

    Tan, Jun ; Bu, Yingyong ; Yang, Bo

  • Author_Institution
    Coll. of Mech. & Electr. Eng., Central South Univ., Changsha, China
  • Volume
    1
  • fYear
    2009
  • fDate
    10-11 Oct. 2009
  • Firstpage
    528
  • Lastpage
    531
  • Abstract
    Efficient algorithms for mining frequent itemsets are crucial for mining association rules and for other data mining tasks. FP-growth algorithm has been implemented using a prefix-tree structure, known as a FP-tree, for storing compressed frequency information. Numerous experimental results have demonstrated that the algorithm perform extremely well. But In FP-growth algorithm, two traversals of FP-tree are needed for constructing the new conditional FP-tree. In this paper we present a novel FP-array technique that greatly reduces the need to traverse FP-trees, thus obtaining significantly improved performance for FP-tree based algorithms. We then present a very effective closed frequent pattern algorithm which uses a variation of the FP-tree data structure in combination with the FP-array technique efficiently. In the algorithm, an efficient closeness-testing approach is also given for mining closed frequent itemsets. Experimental results show that the new algorithm outperform other algorithm in not only the speed of algorithms, but also their scalability.
  • Keywords
    data compression; data mining; tree data structures; FP-array technique; FP-tree based algorithm; association rule mining; close frequent pattern mining algorithm; closed FP-growth algorithm; closeness-testing approach; compressed frequency information; prefix-tree structure; traverse FP-tree data structure; Association rules; Automation; Data mining; Data structures; Educational institutions; Electronic mail; Frequency; Itemsets; Scalability; Transaction databases; Closed FP-growth algorithm; Closed FP-tree; FP-array; sparse datasets;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computation Technology and Automation, 2009. ICICTA '09. Second International Conference on
  • Conference_Location
    Changsha, Hunan
  • Print_ISBN
    978-0-7695-3804-4
  • Type

    conf

  • DOI
    10.1109/ICICTA.2009.134
  • Filename
    5287596