DocumentCode
3116331
Title
A change detector for mining frequent patterns over evolving data streams
Author
Ng, Willie ; Dash, Manoranjan
Author_Institution
Centre for Adv. Inf. Syst., Nanyang Technol. Univ., Singapore
fYear
2008
fDate
12-15 Oct. 2008
Firstpage
2407
Lastpage
2412
Abstract
Mining data streams for frequent patterns is important in many applications. Unlike traditional static databases, the underlying process that generates the data streams evolves over time. Past data may become outdated and of little use when compared to the most recent one. When a significant change occurs, much harm is done to the mining result if it is not properly handled. In this paper, an online algorithm for change detection in frequent pattern mining is proposed. Although there have been several studies mainly for adapting to changes, we contend that this is not enough. The ability to detect and characterize change is essential in many applications. A novel test strategy is designed to gather the ldquoevidencerdquo sufficient to conclude on whether the current sample differ significantly from a reference sample. Different statistical tests are evaluated and our study shows that the chi-square test is the most suitable for enumerated or count data.
Keywords
data mining; statistical testing; change detector; chi-square test; data streams; frequent pattern mining; statistical tests; Application software; Change detection algorithms; Data mining; Databases; Detectors; Feeds; Information systems; Itemsets; Sampling methods; Testing; Change Detection; Data Stream; Frequent Pattern Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location
Singapore
ISSN
1062-922X
Print_ISBN
978-1-4244-2383-5
Electronic_ISBN
1062-922X
Type
conf
DOI
10.1109/ICSMC.2008.4811655
Filename
4811655
Link To Document