DocumentCode
1636943
Title
An Efficient Approximate Approach to Mining Frequent Itemsets over High Speed Transactional Data Streams
Author
Kuen-Fang Jea ; Chao-Wei Li ; Tsui-Ping Chang
Author_Institution
Dept. of Comput. Sci. & Eng., Nat. Chung-Hsing Univ., Taichung
Volume
3
fYear
2008
Firstpage
275
Lastpage
280
Abstract
A data stream is a massive and unbounded sequence of data elements that are continuously generated at a fast speed. Compared with traditional data mining, knowledge discovery in data streams is more challenging since several requirements need to be satisfied. In this paper we propose a mining algorithm for finding frequent itemsets over a transactional data stream. Unlike most of existing algorithms, our method works based on the theory of Approximate Inclusion-Exclusion to approximate the itemsets´ counts. Some techniques are designed and integrated into the algorithm for performance improvement. And the performance of the proposed algorithm is tested and analyzed through several experiments.
Keywords
data mining; approximate inclusion-exclusion; data mining; frequent itemset mining algorithm; knowledge discovery; transactional data streams; Algorithm design and analysis; Application software; Chaos; Computer science; Data engineering; Data mining; Data structures; Design engineering; Intelligent systems; Itemsets; approximate approach; combinatorial approximation; data stream; frequent itemsets mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications, 2008. ISDA '08. Eighth International Conference on
Conference_Location
Kaohsiung
Print_ISBN
978-0-7695-3382-7
Type
conf
DOI
10.1109/ISDA.2008.74
Filename
4696474
Link To Document