DocumentCode :
3112988
Title :
Mining Interesting Ratio Patterns over a Stream Sliding Window
Author :
Fan, Wei ; Watanabe, Toshio ; Asakura, Koichi
Author_Institution :
Nagoya Univ., Nagoya, Japan
fYear :
2009
fDate :
13-15 Dec. 2009
Firstpage :
731
Lastpage :
734
Abstract :
In this paper, we aim to capture dependency among high correlated attributes in a sliding window of data streams at any time. Naive solutions cannot address combinatorial explosion problems. In order to generate highly correlated sets, we propose a candidate-generation rule based on Jaccard distance, and a generation method based on min-hashing functions to estimate the Jaccard distance without requiring to count supports explicitly. For mining frequent ratios, we introduce a concept of ratio range to reduce the candidate ratio sets efficiently. Additionally, the downward closure property of ratio patterns is also available to prune candidate sets. Our extensive experiments on both real and synthetic datasets verify the efficiency of our algorithms.
Keywords :
data mining; file organisation; Jaccard distance; candidate ratio sets; candidate-generation rule; data stream sliding window; downward closure property; frequent ratio mining; highly correlated sets; min-hashing functions; ratio pattern mining; Eigenvalues and eigenfunctions; Explosions; Machine learning; Mutual information; Time factors; Transaction databases; Ratio pattern; correlation mining; data stream;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications, 2009. ICMLA '09. International Conference on
Conference_Location :
Miami Beach, FL
Print_ISBN :
978-0-7695-3926-3
Type :
conf
DOI :
10.1109/ICMLA.2009.85
Filename :
5381325
Link To Document :
بازگشت