DocumentCode
3024809
Title
A Mining Maximal Frequent Itemsets over the Entire History of Data Streams
Author
Mao, Yinmin ; Li, Hong ; Yang, Lumin ; Chen, Zhigang ; Liu, Lixin
Author_Institution
Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
fYear
2009
fDate
25-26 April 2009
Firstpage
413
Lastpage
417
Abstract
Mining maximal frequent itemsets has been widely concerned. However, mining data streams is more difficult than mining static databases because of the huge, high-speed and continuous characteristics of streaming data. This paper presents an algorithm, called IDSM-MFI. The algorithm uses a synopsis data structure to store the items of transactions embedded data streams so far. It adopts a top-bottom and bottom-top method to mine the set of all maximal frequent itemsets in landmark windows over data stream, which can be output in real time based on users´ specified thresholds. Theoretical analysis and experimental results show that our algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of data stream.
Keywords
data mining; data structures; IDSM-MFI algorithm; bottom-top method; data streams mining; landmark windows; maximal frequent itemset mining; synopsis data structure; top-bottom method; Algorithm design and analysis; Data engineering; Data mining; Data structures; Frequency; History; Information science; Itemsets; Sensor phenomena and characterization; Transaction databases; data mining; data streams; maximal frequent itemsets;
fLanguage
English
Publisher
ieee
Conference_Titel
Database Technology and Applications, 2009 First International Workshop on
Conference_Location
Wuhan, Hubei
Print_ISBN
978-0-7695-3604-0
Type
conf
DOI
10.1109/DBTA.2009.125
Filename
5207728
Link To Document