DocumentCode
2973137
Title
Efficient Incremental Subspace Clustering in Data Streams
Author
Kontaki, Maria ; Papadopoulos, Apostolos N. ; Manolopoulos, Yannis
Author_Institution
Dept. of Informatics, Aristotle Univ., Thessaloniki
fYear
2006
fDate
Dec. 2006
Firstpage
53
Lastpage
60
Abstract
Performing data mining tasks in streaming data is considered a challenging research direction, due to the continuous data evolution. In this work, we focus on the problem of clustering streaming time series, based on the sliding window paradigm. More specifically, we use the concept of alpha-clusters in each time instance separately. A subspace alpha-cluster consists of a set of streams, whose value difference is less than a in a consecutive number of time instances (dimensions). The clusters can be continuously and incrementally updated as the streaming time series evolve. The proposed technique is based on a careful examination of pair-wise stream similarities for a subset of dimensions and then, it is generalized for more streams per cluster. Performance evaluation results show that the proposed pruning criteria are important for search space reduction, and that the cost of incremental cluster monitoring is computationally more efficient than reclustering
Keywords
data mining; pattern clustering; data mining; data streaming; incremental cluster monitoring; incremental subspace clustering; search space reduction; sliding window paradigm; time series; Clustering algorithms; Costs; Data analysis; Data engineering; Data mining; Databases; Informatics; Monitoring; Query processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Database Engineering and Applications Symposium, 2006. IDEAS '06. 10th International
Conference_Location
Delhi
ISSN
1098-8068
Print_ISBN
0-7695-2577-6
Type
conf
DOI
10.1109/IDEAS.2006.19
Filename
4041603
Link To Document