DocumentCode :
2539547
Title :
An Improved Online Stream Data Clustering Algorithm
Author :
Li, Lingjuan ; Li, Xiong
Author_Institution :
Coll. of Comput., Nanjing Univ. of Posts & Telecommun., Nanjing, China
fYear :
2012
fDate :
12-14 Oct. 2012
Firstpage :
526
Lastpage :
529
Abstract :
The stream data mining is a hot research topic in recent years. In order to improve the efficiency of stream data mining, this paper designs an online stream data clustering algorithm IStrAP. IStrAP considers the features of stream data, such as potentially infinity, rapidness, and inability to scan historical data repeatedly, and introduces a method of eliminating outliers to the existing algorithm StrAP. IStrAP does statistical analysis of the data in reservoir (a temporary storage area) to get the statistics and the parameters that can reflect the data characteristics, removes the abnormal data from the reservoir according to the statistical properties, and then clusters the residuary data in the reservoir. The experimental results show that IStrAP can effectively eliminate outliers, and it not only has higher clustering accuracy and lower time complexity than existing StrAP algorithm, but also has better dynamic adaptability for the stream data.
Keywords :
computational complexity; data mining; pattern clustering; statistical analysis; storage management; IStrAP algorithm; clustering accuracy; data characteristics; data reservoir; historical data; online stream data clustering algorithm; residuary data; statistical analysis; statistical properties; stream data mining; temporary storage area; time complexity; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Data models; Reservoirs; StrAP; clustering; outliers; stream data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Business Computing and Global Informatization (BCGIN), 2012 Second International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-4469-2
Type :
conf
DOI :
10.1109/BCGIN.2012.143
Filename :
6382584
Link To Document :
بازگشت