DocumentCode
3167037
Title
Incremental Subspace Clustering over Multiple Data Streams
Author
Zhang, Qi ; Liu, Jinze ; Wang, Wei
Author_Institution
Univ. of North Carolina, Charlotte
fYear
2007
fDate
28-31 Oct. 2007
Firstpage
727
Lastpage
732
Abstract
Data streams are often locally correlated, with a subset of streams exhibiting coherent patterns over a subset of time points. Subspace clustering can discover clusters of objects in different subspaces. However, traditional sub- space clustering algorithms for static data sets are not readily used for incremental clustering, and is very expensive for frequent re-clustering over dynamically changing stream data. In this paper, we present an efficient incremental sub- space clustering algorithm for multiple streams over sliding windows. Our algorithm detects all the delta-CC-Clusters, which capture the coherent changing patterns among a set of streams over a set of time points. delta-CC´-Cluster s are incrementally generated by traversing a directed acyclic graph pDAG. We propose efficient insertion and deletion operations to update the pDAG dynamically. In addition, effective pruning techniques are applied to reduce the search space. Experiments on real data sets demonstrate the performance of our algorithm.
Keywords
data analysis; pattern clustering; delta-CC-Clusters; effective pruning technique; incremental subspace clustering; multiple data streams; sliding windows; static data sets; Algorithm design and analysis; Boolean functions; Clustering algorithms; Clustering methods; Data analysis; Data mining; Data structures; Monitoring; Subspace constraints; Telecommunication traffic;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location
Omaha, NE
ISSN
1550-4786
Print_ISBN
978-0-7695-3018-5
Type
conf
DOI
10.1109/ICDM.2007.100
Filename
4470318
Link To Document