• DocumentCode
    2993660
  • Title

    A Simple But Effective Stream Maximal Frequent Itemset Mining Algorithm

  • Author

    Li, Haifeng ; Zhang, Ning

  • Author_Institution
    Sch. of Inf., Central Univ. of Finance & Econ., Beijing, China
  • fYear
    2011
  • fDate
    3-4 Dec. 2011
  • Firstpage
    1268
  • Lastpage
    1272
  • Abstract
    Maximal frequent item sets are one of several condensed representations of frequent item sets, which store most of the information contained in frequent item sets using less space, thus being more suitable for stream mining. This paper focuses on mining maximal frequent item sets approximately over a stream landmark model. We separate the continuously arriving transactions into sections, and the mining results are indexed by an extended direct update tree, thus, a simple but effective algorithm named SMIS is proposed. In our algorithm, we employ the Chern off Bound to perform the maximal frequent item set mining in a false negative manner, which can reduce the memory cost, as well guarantee our algorithm conducting with an incremental fashion. Our experimental results on two synthetic datasets and two real world datasets show that SMIS achieves much reduced memory cost in comparison with the state-of-the-art algorithm with a 100 percent precision.
  • Keywords
    data mining; set theory; SMIS; chern off bound; stream landmark model; stream maximal frequent itemset mining algorithm; Approximation algorithms; Approximation methods; Data mining; Educational institutions; Indexes; Itemsets; false negative; maximal frequent itemset; stream;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Security (CIS), 2011 Seventh International Conference on
  • Conference_Location
    Hainan
  • Print_ISBN
    978-1-4577-2008-6
  • Type

    conf

  • DOI
    10.1109/CIS.2011.281
  • Filename
    6128440