DocumentCode :
3352367
Title :
Approximate frequent itemsets compression using dynamic clustering method
Author :
Yan, Hua ; Sang, Yongsheng
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu
fYear :
2008
fDate :
21-24 Sept. 2008
Firstpage :
1061
Lastpage :
1066
Abstract :
Frequent-itemsets mining often faces the problem of generating a large collection of frequent itemsets, which is too large to be carefully examined and understood by the users. To reduce the output size of frequent itemsets, we propose using a dynamic clustering method to compress the frequent itemsets approximately in this paper. Concretely, two frequent itemsets intra-cluster similarities, expression similarity and support similarity, are defined according to the specific requirements of frequent itemsets compression. Based on the above two similarity measures, the frequent itemsets clustering criterion and its related clustering algorithm are developed. Specially, our method has two features: 1)users neednpsilat specify the number of frequent itemsets clusters explicitly; 2)userpsilas expectation of compression ratio is incorporated. Our initial experimental results show that our approximate frequent itemsets method is feasible and the compression quality is good.
Keywords :
data compression; data mining; pattern clustering; dynamic clustering method; expression similarity; frequent itemsets compression; frequent-itemsets mining; intra-cluster similarities; support similarity; Algorithm design and analysis; Clustering algorithms; Clustering methods; Computational intelligence; Computer science; Data mining; Greedy algorithms; Itemsets; Laboratories; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cybernetics and Intelligent Systems, 2008 IEEE Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-1673-8
Electronic_ISBN :
978-1-4244-1674-5
Type :
conf
DOI :
10.1109/ICCIS.2008.4670945
Filename :
4670945
Link To Document :
بازگشت