DocumentCode
402886
Title
An equivalence-class-based algorithm for the maximal number of candidate itemsets
Author
Wang, Yun-Lan ; Li, Zeng-Zhi ; Qu, Ke-Wen
Author_Institution
Dept. of Comput. Sci. & Technol., Xi´´an Jiaotong Univ., China
Volume
1
fYear
2003
fDate
2-5 Nov. 2003
Firstpage
272
Abstract
Mining association rules is one of the most important problems in the field of data mining. To device an algorithm that can reduce the number of database scans and without taking the risk of getting a combinatorial explosion of the number of candidate itemsets, we must study what is the maximal number of candidate itemsets that can be generated. In this paper, the theory of itemset equivalence class is proposed. The property of itemset equivalence class is explored and some useful lemmas are presented. Based on the foregoing theory and a priori property, we derive some theorems about the maximal number of candidate itemsets. Based on these theorems, we device an algorithm EC for calculating the maximal number of candidate itemsets, Furthermore, the performance study shows that algorithm EC is more accurate than algorithm KK and the cost for computing the maximal number of negligible compared to the cost of the complete algorithm for association rules.
Keywords
data mining; database theory; set theory; candidate itemsets; data mining; database scans; equivalence-class-based algorithm; itemset equivalence class; mining association rules; Association rules; Computer science; Costs; Data mining; Explosions; Itemsets; Partitioning algorithms; Switches; Transaction databases; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN
0-7803-8131-9
Type
conf
DOI
10.1109/ICMLC.2003.1264485
Filename
1264485
Link To Document