An FP-split method for fast association rules mining

Author

Lee, Chin-Feng ; Shen, Tsung-Hsien

Author_Institution

Dept. of Inf. Manage., Chaoyang Univ. of Technol., Taichung, Taiwan

fYear

2005

fDate

27-30 June 2005

Firstpage

459

Lastpage

463

Abstract

Recently, most of the studies on association rules mining focused on improving the efficiency of frequent itemsets generation. To our best knowledge, the FP-growth algorithm, which is based on the FP-tree to generate frequent itemsets is time-efficient. Currently, relevant studies are introduced to improve the FP-growth algorithm. However, they ignore the fact that the FP-tree construction may spend much time. Therefore, the goal of our research is to propose a fast algorithm called frequent pattern split, simply FP-split, for improving the process of the FP-tree construction. The proposed FP-split algorithm contains two main steps. The first step is to scan a transaction database only once for generating equivalence classes of frequent items. The second step is to sort these equivalence classes of frequent items in descending order so as to construct the FP-split tree. Through detailed experimental evaluations under various system conditions, our method shows excellent performance in terms of execution efficiency and scalability.

Keywords

data mining; equivalence classes; sorting; transaction processing; tree data structures; trees (mathematics); FP-split method; FP-tree; association rule mining; equivalence classes; frequent itemsets generation; frequent pattern; sorting; transaction database scanning; Association rules; Chaos; Data mining; Electronic mail; Information analysis; Information management; Information technology; Itemsets; Scalability; Transaction databases;

fLanguage

English

Publisher

ieee

Conference_Titel

Information Technology: Research and Education, 2005. ITRE 2005. 3rd International Conference on

Print_ISBN

0-7803-8932-8

Type

conf

DOI

10.1109/ITRE.2005.1503165

Filename

1503165