Title :
Searching for closed itemset with formal concept analysis
Author :
Qi, Hong ; Liu, Da-you ; Hu, Cheng-Quan ; Lu, Ming ; Zhao, Liang
Author_Institution :
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
Abstract :
The problem of mining frequent patterns plays an essential role in many important data mining tasks. However, it often generates a very large number of frequent itemsets. The set of frequent closed itemsets uniquely determines the exact frequency of all itemsets, yet it can be orders of magnitude smaller than the set of all frequent itemsets. An algorithm Closearcher based on formal concept analysis for closed itemset searching is proposed. This algorithm divides the whole search space of closed itemsets into several subspaces in accordance with criterions prescribed ahead, and introduces an efficient scheme to recognize the valid ones, in which the search for closed itemsets is bounded. An intermediate structure is employed to judge the validity of a subspace and search closed itemsets more efficiently. The algorithm is experimental evaluated and compared with the famous NextClosure algorithm proposed by Ganter for random generated data, as well as for real application data. The results show that our algorithm performs much better than the later.
Keywords :
data mining; pattern recognition; set theory; Closearcher; NextClosure algorithm; closed itemset searching; formal concept analysis; frequent patterns mining; Algorithm design and analysis; Computer science; Data mining; Databases; Educational institutions; Educational technology; Frequency; Itemsets; Machine learning algorithms; Pattern analysis;
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
DOI :
10.1109/ICMLC.2004.1382381