DocumentCode :
3026597
Title :
Density-Based Data Streams Subspace Clustering over Weighted Sliding Windows
Author :
Ren, Jiadong ; Cao, Shiyuan ; Hu, Changzhen
Author_Institution :
Coll. of Inf. Sci. & Eng., Yanshan Univ., Qinhuangdao, China
fYear :
2010
fDate :
23-24 Oct. 2010
Firstpage :
212
Lastpage :
216
Abstract :
Most real-world data sets are characterized by a high dimensinal, inherely sparse data space. In this paper, we present a novel density-based approach to the subspace clustering problem. A new framework for data stream mining is introduced, called the weighted sliding window. In the online component, the structure of Exponential Histogram of Cluster Feature(EHCF) is improved to maintain the micro-clusters. The concepts of potential core-micro-cluster and outlier micro-cluster are applied to distinguish the potential clusters and outliers. A novel pruning strategy is proposed to decrease the number of micro-clusters. In the offline component, the final clusters are generated by SUBCLU algorithm. Our performance study demonstrates the effectiveness and efficiency of our algorithm.
Keywords :
data mining; pattern clustering; SUBCLU algorithm; core-microclusters; data stream mining; density-based data streams subspace clustering; exponential histogram of cluster feature; outlier microcluster; pruning strategy; sparse data space; weighted sliding windows; Algorithm design and analysis; Clustering algorithms; Data mining; Data models; Finite element methods; Histograms; Knowledge engineering; data stream; density-based; subspace clustering; weighted sliding windows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cryptography and Network Security, Data Mining and Knowledge Discovery, E-Commerce & Its Applications and Embedded Systems (CDEE), 2010 First ACIS International Symposium on
Conference_Location :
Qinhuangdao
Print_ISBN :
978-1-4244-9595-5
Type :
conf
DOI :
10.1109/CDEE.2010.48
Filename :
5759375
Link To Document :
بازگشت