Title :
An Efficient Approximate Approach to Mining Frequent Itemsets over High Speed Transactional Data Streams
Author :
Kuen-Fang Jea ; Chao-Wei Li ; Tsui-Ping Chang
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Chung-Hsing Univ., Taichung
Abstract :
A data stream is a massive and unbounded sequence of data elements that are continuously generated at a fast speed. Compared with traditional data mining, knowledge discovery in data streams is more challenging since several requirements need to be satisfied. In this paper we propose a mining algorithm for finding frequent itemsets over a transactional data stream. Unlike most of existing algorithms, our method works based on the theory of Approximate Inclusion-Exclusion to approximate the itemsets´ counts. Some techniques are designed and integrated into the algorithm for performance improvement. And the performance of the proposed algorithm is tested and analyzed through several experiments.
Keywords :
data mining; approximate inclusion-exclusion; data mining; frequent itemset mining algorithm; knowledge discovery; transactional data streams; Algorithm design and analysis; Application software; Chaos; Computer science; Data engineering; Data mining; Data structures; Design engineering; Intelligent systems; Itemsets; approximate approach; combinatorial approximation; data stream; frequent itemsets mining;
Conference_Titel :
Intelligent Systems Design and Applications, 2008. ISDA '08. Eighth International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-0-7695-3382-7
DOI :
10.1109/ISDA.2008.74