DocumentCode :
3457754
Title :
The FP-Tree Algorithm Used for Data Stream
Author :
Tu, Qiang ; Lu, Jian-Feng ; Tang, Jiu-Bin ; Yang, Jing-Yu
Author_Institution :
Dept. Of Comput. Sci., Nanjing Univ. Of Sci. & Technol., Nanjing, China
fYear :
2010
fDate :
21-23 Oct. 2010
Firstpage :
1
Lastpage :
5
Abstract :
FP-growth is a classical algorithm in frequent pattern mining, which is often used in static data mining. Some researches have been done on using FP-growth algorithm to analyze streaming data. However, the double-scan-of-database manner in FP-tree creation is a serious bottleneck in streaming data analysis. Sliding window technique could solve this problem in certain degree, but it still can lead to the inaccuracy of FP-tree creation, which may impact the consequent data mining. In this paper, a new FP-tree algorithm is presented for streaming data, which creates the FP-tree by a single-pass scanning (SPSFP) throughout the database. Compared with the traditional FP-tree creation, the new method scans the database only once and doesn´t need to store the whole data set into memory, which not only saves the memory space but also makes it possible to mine frequent pattern accurately in streaming data. Furthermore, the time cost of the new algorithm is almost equivalent to the traditional one.
Keywords :
data analysis; data mining; database management systems; trees (mathematics); FP growth algorithm; FP tree algorithm; data streaming analysis; double-scan-of-database; pattern mining; single pass scanning; sliding window technique; static data mining; Algorithm design and analysis; Data mining; Databases; Electronic mail; Information science; Next generation networking; Presses;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (CCPR), 2010 Chinese Conference on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-7209-3
Electronic_ISBN :
978-1-4244-7210-9
Type :
conf
DOI :
10.1109/CCPR.2010.5659233
Filename :
5659233
Link To Document :
بازگشت