Title :
Efficient Incremental Subspace Clustering in Data Streams
Author :
Kontaki, Maria ; Papadopoulos, Apostolos N. ; Manolopoulos, Yannis
Author_Institution :
Dept. of Informatics, Aristotle Univ., Thessaloniki
Abstract :
Performing data mining tasks in streaming data is considered a challenging research direction, due to the continuous data evolution. In this work, we focus on the problem of clustering streaming time series, based on the sliding window paradigm. More specifically, we use the concept of alpha-clusters in each time instance separately. A subspace alpha-cluster consists of a set of streams, whose value difference is less than a in a consecutive number of time instances (dimensions). The clusters can be continuously and incrementally updated as the streaming time series evolve. The proposed technique is based on a careful examination of pair-wise stream similarities for a subset of dimensions and then, it is generalized for more streams per cluster. Performance evaluation results show that the proposed pruning criteria are important for search space reduction, and that the cost of incremental cluster monitoring is computationally more efficient than reclustering
Keywords :
data mining; pattern clustering; data mining; data streaming; incremental cluster monitoring; incremental subspace clustering; search space reduction; sliding window paradigm; time series; Clustering algorithms; Costs; Data analysis; Data engineering; Data mining; Databases; Informatics; Monitoring; Query processing;
Conference_Titel :
Database Engineering and Applications Symposium, 2006. IDEAS '06. 10th International
Conference_Location :
Delhi
Print_ISBN :
0-7695-2577-6
DOI :
10.1109/IDEAS.2006.19