• DocumentCode
    1904467
  • Title

    An FP-split method for fast association rules mining

  • Author

    Lee, Chin-Feng ; Shen, Tsung-Hsien

  • Author_Institution
    Dept. of Inf. Manage., Chaoyang Univ. of Technol., Taichung, Taiwan
  • fYear
    2005
  • fDate
    27-30 June 2005
  • Firstpage
    459
  • Lastpage
    463
  • Abstract
    Recently, most of the studies on association rules mining focused on improving the efficiency of frequent itemsets generation. To our best knowledge, the FP-growth algorithm, which is based on the FP-tree to generate frequent itemsets is time-efficient. Currently, relevant studies are introduced to improve the FP-growth algorithm. However, they ignore the fact that the FP-tree construction may spend much time. Therefore, the goal of our research is to propose a fast algorithm called frequent pattern split, simply FP-split, for improving the process of the FP-tree construction. The proposed FP-split algorithm contains two main steps. The first step is to scan a transaction database only once for generating equivalence classes of frequent items. The second step is to sort these equivalence classes of frequent items in descending order so as to construct the FP-split tree. Through detailed experimental evaluations under various system conditions, our method shows excellent performance in terms of execution efficiency and scalability.
  • Keywords
    data mining; equivalence classes; sorting; transaction processing; tree data structures; trees (mathematics); FP-split method; FP-tree; association rule mining; equivalence classes; frequent itemsets generation; frequent pattern; sorting; transaction database scanning; Association rules; Chaos; Data mining; Electronic mail; Information analysis; Information management; Information technology; Itemsets; Scalability; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: Research and Education, 2005. ITRE 2005. 3rd International Conference on
  • Print_ISBN
    0-7803-8932-8
  • Type

    conf

  • DOI
    10.1109/ITRE.2005.1503165
  • Filename
    1503165