DocumentCode :
2507707
Title :
Generalized closed itemsets for association rule mining
Author :
Pudi, Vikram ; Haritsa, Jayant R.
Author_Institution :
Database Syst. Lab, Indian Inst. of Sci., Bangalore, India
fYear :
2003
fDate :
5-8 March 2003
Firstpage :
714
Lastpage :
716
Abstract :
The output of Boolean association rule mining algorithms is often too large for manual examination. For dense datasets, it is often impractical to even generate all frequent itemsets. The closed itemset approach handles this information overload by pruning "uninteresting" rules following the observation that most rules can be derived from other rules. We propose a new framework, namely, the generalized closed (or g-closed) itemset framework. By allowing for a small tolerance in the accuracy of itemset supports, we show that the number of such redundant rules is far more than what was previously estimated. Our scheme can be integrated into both levelwise algorithms (Apriori) and two-pass algorithms (ARMOR). We evaluate its performance by measuring the reduction in output size as well as in response time. Our experiments show that incorporating g-closed itemsets provides significant performance improvements on a variety of databases.
Keywords :
data integrity; data mining; database management systems; ARMOR algorithm; Apriori algorithm; association rule mining; data integration; databases; generalized closed itemsets; levelwise algorithms; output size; redundant rules; response time; two-pass algorithms; Association rules; Data mining; Database systems; Delay; Identity management systems; Itemsets; Robustness; Size measurement; Time measurement; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2003. Proceedings. 19th International Conference on
Print_ISBN :
0-7803-7665-X
Type :
conf
DOI :
10.1109/ICDE.2003.1260845
Filename :
1260845
Link To Document :
بازگشت