• Title of article

    DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets

  • Author/Authors

    Vo، نويسنده , , Bay and Hong، نويسنده , , Tzung-Pei and Le، نويسنده , , Bac، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2012
  • Pages
    11
  • From page
    7196
  • To page
    7206
  • Abstract
    Frequent closed itemsets (FCI) play an important role in pruning redundant rules fast. Therefore, a lot of algorithms for mining FCI have been developed. Algorithms based on vertical data formats have some advantages in that they require scan databases once and compute the support of itemsets fast. Recent years, BitTable (Dong & Han, 2007) and IndexBitTable (Song, Yang, & Xu, 2008) approaches have been applied for mining frequent itemsets and results are significant. However, they always use a fixed size of Bit-Vector for each item (equal to number of transactions in a database). It leads to consume more memory for storage Bit-Vectors and the time for computing the intersection among Bit-Vectors. Besides, they only apply for mining frequent itemsets, algorithm for mining FCI based on BitTable is not proposed. This paper introduces a new method for mining FCI from transaction databases. Firstly, Dynamic Bit-Vector (DBV) approach will be presented and algorithms for fast computing the intersection between two DBVs are also proposed. Lookup table is used for fast computing the support (number of bits 1 in a DBV) of itemsets. Next, subsumption concept for memory and computing time saving will be discussed. Finally, an algorithm based on DBV and subsumption concept for mining frequent closed itemsets fast is proposed. We compare our method with CHARM, and recognize that the proposed algorithm is more efficient than CHARM in both the mining time and the memory usage.
  • Keywords
    BitTable , Dynamic Bit-Vector , DATA MINING , Frequent closed itemsets , Vertical data format
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2012
  • Journal title
    Expert Systems with Applications
  • Record number

    2351914