DocumentCode :
2243427
Title :
Clustering algorithm for probabilistic data stream over sliding windows
Author :
Hu, Wei-Cheng ; Cheng, Zhuan-Liu
Author_Institution :
Coll. of Comput. Sci., Hefei Technol. Univ., Hefei, China
Volume :
4
fYear :
2010
fDate :
11-14 July 2010
Firstpage :
2065
Lastpage :
2070
Abstract :
An effective clustering algorithm called PWStream for probabilistic data stream over sliding window is developed in this paper. The algorithm uses exponential histogram of cluster feature to store the summary information of the most recently arrived tuples, and outdated information is deleted within a certain guaranteed range of error. For the uncertain tuples in data stream, the concepts of strong cluster, transitional cluster and weak cluster are proposed in the PWStream. With these concepts, an effective strategy of choosing cluster based on distance and existence probability is designed, which can find more strong clusters. Theoretical analysis and comprehensive experimental results demonstrate that the proposed method is of high quality and fast processing rate.
Keywords :
pattern clustering; probability; PWStream; clustering algorithm; probabilistic data stream; sliding windows; Algorithm design and analysis; Clustering algorithms; Cybernetics; Histograms; Machine learning; Machine learning algorithms; Probabilistic logic; Clustering; Histogram; Probabilistic data stream; Sliding window;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-1-4244-6526-2
Type :
conf
DOI :
10.1109/ICMLC.2010.5580503
Filename :
5580503
Link To Document :
بازگشت