DocumentCode :
1688176
Title :
Efficiently mining maximal frequent sets for discovering association rules
Author :
Srikumar, Krishnamoorthy ; Bhasker, Bharat
Author_Institution :
Indian Inst. of Manage., Lucknow, India
fYear :
2003
Firstpage :
104
Lastpage :
110
Abstract :
We present Metamorphosis, an algorithm for mining maximal frequent sets (MFS) using data transformations. Metamorphosis efficiently transforms the dataset to maximum collapsible and compressible (MC2) format and employs a top down strategy with phased bottom up search for mining MFS. Using the chess and connect dataset [benchmark datasets created by Univ. of California, Irvine], we demonstrate that our algorithm offers better performance in mining MFS compared to dGenMax (an algorithm that offers better performance compared to other known algorithms) at higher support levels. Furthermore, we evaluate our algorithm for mining Top-K maximal frequent sets in chess and connect datasets. Our algorithm is especially efficient when the maximal frequent sets are longer.
Keywords :
associative processing; data mining; data structures; pattern recognition; program verification; tree searching; MC2 format; MFS mining; Top-K maximal frequent set; algorithm evaluation; association rule discovery; benchmark dataset; chess and connect dataset; dGenMax; data transformation; dataset transformation; depth first searching; maximal frequent set mining; maximum collapsible and compressible format; metamorphosis algorithm; phased bottom up searching; top down strategy; Association rules; Data engineering; Data mining; Electronic mail; Frequency; Itemsets; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Engineering and Applications Symposium, 2003. Proceedings. Seventh International
ISSN :
1098-8068
Print_ISBN :
0-7695-1981-4
Type :
conf
DOI :
10.1109/IDEAS.2003.1214916
Filename :
1214916
Link To Document :
بازگشت