Title :
A New Algorithm for Mining Global Frequent Itemsets in a Stream
Author :
Guo, Lichao ; Su, Hongye ; Qu, Yu
Author_Institution :
State Key Lab. of Ind. Control Technol., Zhejiang Univ., Hangzhou, China
Abstract :
To find global frequent itemsets in a multiple, continuous, rapid and time-varying data stream, a fast, incremental, real-time, and little-memory-cost algorithm should be used. Based on the max-frequency window model, a BHS summary structure and a novel algorithm called GGFI-MFW are proposed. It merely updates the summaries for subsets of the data new arrived and could directly generate the max-frequency for a given item set without scanning the whole summary. Experiment results indicate that the proposed algorithm could efficiently find global frequent itemsets over a data stream with a small memory and perform overwhelming superiority for a large number of distinct items.
Keywords :
data mining; BHS summary structure; GGFI-MFW; little-memory-cost algorithm; max-frequency window model; mining global frequent itemsets; time-varying data stream; Data mining; Frequency measurement; Frequency shift keying; Fuzzy systems; Ice; Industrial control; Itemsets; Marketing and sales; Research and development;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.265