• Title of article

    Efficient single-pass frequent pattern mining using a prefix-tree

  • Author/Authors

    Syed Khairuzzaman Tanbeer، نويسنده , , Chowdhury Farhan Ahmed، نويسنده , , Byeong-Soo Jeong، نويسنده , , Young-Koo Lee، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2009
  • Pages
    25
  • From page
    559
  • To page
    583
  • Abstract
    The FP-growth algorithm using the FP-tree has been widely studied for frequent pattern mining because it can dramatically improve performance compared to the candidate generation-and-test paradigm of Apriori. However, it still requires two database scans, which are not consistent with efficient data stream processing. In this paper, we present a novel tree structure, called CP-tree (compact pattern tree), that captures database information with one scan (insertion phase) and provides the same mining performance as the FP-growth method (restructuring phase). The CP-tree introduces the concept of dynamic tree restructuring to produce a highly compact frequency-descending tree structure at runtime. An efficient tree restructuring method, called the branch sorting method, that restructures a prefix-tree branch-by-branch, is also proposed in this paper. Moreover, the CP-tree provides full functionality for interactive and incremental mining. Extensive experimental results show that the CP-tree is efficient for frequent pattern mining, interactive, and incremental mining with a single database scan.
  • Keywords
    DATA MINING , Frequent pattern , Association Rule , Incremental mining , Interactive mining
  • Journal title
    Information Sciences
  • Serial Year
    2009
  • Journal title
    Information Sciences
  • Record number

    1213522