• DocumentCode
    3286317
  • Title

    A New Fast Frequent Itemsets Mining Algorithm Based on Forest

  • Author

    Hu, Jian ; Yang-Li, Xiang

  • Author_Institution
    Sch. of Manage., Harbin Inst. of Technol., Harbin
  • Volume
    2
  • fYear
    2008
  • fDate
    18-20 Oct. 2008
  • Firstpage
    551
  • Lastpage
    555
  • Abstract
    Frequent itemsets mining is an important data mining task. Until now there are many algorithms to mine frequent itemsets, this paper emphatically analyses existing algorithms´ memory structure of transaction database and realization skill, as well as the usage of memory. On the basis, two new data structures UFP-Tree and FP-Forest are designed, which use multi-trees structure to store data and solve the single FP-tree storage large scale database difficulty problem. At the same time, a fast frequent itemsets mining algorithm, F-Fminer is presented, which adopts divide and conquer strategy to mine frequent itemsets for every UFP-Tree with deepth-first searching method, then performs right shift combination operation for the branches in UFP-Tree. In order to reduce the time spending of allocation and deallocation memory, this paper also designed a high performance memory manager. According to experimentation on real data sets, the algorithm has greatly enhanced frequent itemsets mining efficiency.
  • Keywords
    data mining; tree data structures; F-Fminer; FP-Forest; UFP-Tree; allocation memory; data mining; data structures; deallocation memory; deepth-first searching method; fast frequent itemsets mining algorithm; Algorithm design and analysis; Conference management; Costs; Data mining; Data structures; Itemsets; Knowledge management; Memory management; Technology management; Transaction databases; Data Mining; Frequent Itemsets; UFP-Tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
  • Conference_Location
    Shandong
  • Print_ISBN
    978-0-7695-3305-6
  • Type

    conf

  • DOI
    10.1109/FSKD.2008.214
  • Filename
    4666177