DocumentCode :
2405128
Title :
Streaming-data algorithms for high-quality clustering
Author :
Callaghan, Liadan O. ; Mishra, Nina ; Meyerson, A. ; Guha, Sudipto ; Motwani, Rajeev
Author_Institution :
Dept. of Comput. Sci., Stanford Univ., CA, USA
fYear :
2002
fDate :
2002
Firstpage :
685
Lastpage :
694
Abstract :
Streaming data analysis has recently attracted attention in numerous applications including telephone records, Web documents and click streams. For such analysis, single-pass algorithms that consume a small amount of memory are critical. We describe such a streaming algorithm that effectively clusters large data streams. We also provide empirical evidence of the algorithm´s performance on synthetic and real data streams
Keywords :
data analysis; pattern clustering; Web documents; click streams; high-quality clustering; large data stream clusters; single-pass algorithms; streaming data analysis; streaming-data algorithms; telephone records; Algorithm design and analysis; Clustering algorithms; Computer science; Data analysis; Data engineering; Lab-on-a-chip; Laboratories; Partitioning algorithms; Telecommunications; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2002. Proceedings. 18th International Conference on
Conference_Location :
San Jose, CA
ISSN :
1063-6382
Print_ISBN :
0-7695-1531-2
Type :
conf
DOI :
10.1109/ICDE.2002.994785
Filename :
994785
Link To Document :
بازگشت