DocumentCode :
467801
Title :
Mining Frequent Closed Itemsets in Large Databases by Hierarchical Partitioning
Author :
Tseng, Fan-chen
Author_Institution :
Kainan Univ., Taoyuan
Volume :
4
fYear :
2007
fDate :
19-22 Aug. 2007
Firstpage :
1832
Lastpage :
1837
Abstract :
The mining of frequent itemsets has been extensively studied in data mining, and many methods have been proposed for this problem. However, mining all the frequent itemsets will lead to a huge number of itemsets and numerous redundant association rules. Fortunately, this problem can be cured by mining only frequent closed itemsets (FCIs), which results in a much smaller number of itemsets. Nevertheless, it is still difficult to find FCIs when the database becomes too large to allow a memory-resident representation. In this paper, a methodology called hierarchical partitioning is proposed for dividing the database into a set of multi-leveled sub-databases of manageable sizes to fit into memory. The advantage of hierarchical partitioning is that the FCIs can be found directly from sub-databases without rescanning the original database for support and subset checking.
Keywords :
data mining; database management systems; data mining; frequent closed itemset mining; hierarchical partitioning; memory-resident representation; multileveled subdatabases; redundant association rules; subset checking; Association rules; Cybernetics; Data mining; Data structures; Electronic commerce; Electronic mail; Frequency; Itemsets; Machine learning; Transaction databases; Frequent closed itemset; Frequent pattern list; Hierarchical partitioning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
Type :
conf
DOI :
10.1109/ICMLC.2007.4370446
Filename :
4370446
Link To Document :
بازگشت