DocumentCode
175846
Title
Parallel frequent itemset mining on streaming data
Author
Yanshan He ; Min Yue
Author_Institution
Electron. & Inf. Sci. Dept., Lanzhou Jiaotong Univ., Lanzhou, China
fYear
2014
fDate
19-21 Aug. 2014
Firstpage
725
Lastpage
730
Abstract
Owing to the widely used of data stream, frequent itemset mining on data stream have received more attention. Data stream is fast changing, massive, and potentially infinite. Therefore, we have to establish new data structure and algorithm to mine it. On the base of our previous work, we propose a new paralleled frequent itemset mining algorithm for data stream based on sliding window, which is called PFIMSD. The algorithm compresses whole data in current window into PSD-trees on paralleled processor only by one-scan. Increment method is used to append or delete related branch on PSD-tree when window is sliding. The experiment shows PFIMSD algorithm has good performance on efficiency and expansibility.
Keywords
data compression; data mining; parallel processing; tree data structures; PFIMSD algorithm; PSD-trees; branch appending; branch deletion; data compression; data streaming; data structure; increment method; parallel frequent itemset mining; paralleled processor; sliding window; Algorithm design and analysis; Approximation algorithms; Data mining; Data structures; Itemsets; Parallel algorithms; Frequent Itemset Mining; Frequent Pattern; High Performance; Paralleled; Streaming Data;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation (ICNC), 2014 10th International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4799-5150-5
Type
conf
DOI
10.1109/ICNC.2014.6975926
Filename
6975926
Link To Document