DocumentCode
1904467
Title
An FP-split method for fast association rules mining
Author
Lee, Chin-Feng ; Shen, Tsung-Hsien
Author_Institution
Dept. of Inf. Manage., Chaoyang Univ. of Technol., Taichung, Taiwan
fYear
2005
fDate
27-30 June 2005
Firstpage
459
Lastpage
463
Abstract
Recently, most of the studies on association rules mining focused on improving the efficiency of frequent itemsets generation. To our best knowledge, the FP-growth algorithm, which is based on the FP-tree to generate frequent itemsets is time-efficient. Currently, relevant studies are introduced to improve the FP-growth algorithm. However, they ignore the fact that the FP-tree construction may spend much time. Therefore, the goal of our research is to propose a fast algorithm called frequent pattern split, simply FP-split, for improving the process of the FP-tree construction. The proposed FP-split algorithm contains two main steps. The first step is to scan a transaction database only once for generating equivalence classes of frequent items. The second step is to sort these equivalence classes of frequent items in descending order so as to construct the FP-split tree. Through detailed experimental evaluations under various system conditions, our method shows excellent performance in terms of execution efficiency and scalability.
Keywords
data mining; equivalence classes; sorting; transaction processing; tree data structures; trees (mathematics); FP-split method; FP-tree; association rule mining; equivalence classes; frequent itemsets generation; frequent pattern; sorting; transaction database scanning; Association rules; Chaos; Data mining; Electronic mail; Information analysis; Information management; Information technology; Itemsets; Scalability; Transaction databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: Research and Education, 2005. ITRE 2005. 3rd International Conference on
Print_ISBN
0-7803-8932-8
Type
conf
DOI
10.1109/ITRE.2005.1503165
Filename
1503165
Link To Document