Title :
Improve Frequent Closed Itemsets Mining over Data Stream with Bitmap
Author :
Li, Haifeng ; Chen, Hong
Author_Institution :
Sch. of Inf., Renmin Univ. of China, Beijing
Abstract :
Frequent itemsets mining is an important problem in data mining. Frequent closed itemsets mining provides complete and condensed information for frequent pattern analysis thus reduces the memory cost without accuracy loss. More research focus on stream mining with the more application of stream. Stream is fast and unlimited thus data had to be stored in limited memory, how to save running time and memory usage is the most important target. In this paper, we propose an improved frequent closed itemsets mining method based on traditional stream mining algorithm CFI-stream with bitmap coding named CLIMB (closed itemset mining with bitmap) over stream´s sliding window. The distinct items are maintained in memory in lexicographic order and each itemset is coded to bit-sequence with the order of items, moreover, the bit-sequence is split into sections to be recoded to reduce the memory cost. The experimental results on real-life show that CLIMB algorithm is effective and efficient.
Keywords :
data mining; CLIMB algorithm; bitmap coding; data mining; data stream; frequent closed itemsets mining; frequent pattern analysis; sliding window; stream mining; Artificial intelligence; Costs; Data engineering; Data mining; Distributed computing; Itemsets; Knowledge engineering; Laboratories; Software engineering; Transaction databases; bitmap; frequent closed itemsets; stream;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on
Conference_Location :
Phuket
Print_ISBN :
978-0-7695-3263-9
DOI :
10.1109/SNPD.2008.31