DocumentCode :
2973938
Title :
PBIRCH: A Scalable Parallel Clustering algorithm for Incremental Data
Author :
Garg, Ashwani ; Mangla, Ashish ; Gupta, Neelima ; Bhatnagar, Vasudha
Author_Institution :
Dept. of Comput. Sci., Delhi Univ.
fYear :
2006
fDate :
Dec. 2006
Firstpage :
315
Lastpage :
316
Abstract :
We present a parallel version of BIRCH with the objective of enhancing the scalability without compromising on the quality of clustering. The incoming data is distributed in a cyclic manner (or block cyclic manner if the data is bursty) to balance the load among processors. The algorithm is implemented on a message passing share-nothing model. Experiments show that for very large data sets the algorithm scales nearly linearly with the increasing number of processors. Experiments also show that clusters obtained by PBIRCH are comparable to those obtained using BIRCH
Keywords :
message passing; parallel algorithms; pattern clustering; resource allocation; very large databases; PBIRCH scalable parallel clustering algorithm; incremental data; load balance; massive dataset clustering; message passing share-nothing model; Algorithm design and analysis; Broadcasting; Clustering algorithms; Computer science; Delay; Memory management; Message passing; Partitioning algorithms; Scalability; Time factors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Engineering and Applications Symposium, 2006. IDEAS '06. 10th International
Conference_Location :
Delhi
ISSN :
1098-8068
Print_ISBN :
0-7695-2577-6
Type :
conf
DOI :
10.1109/IDEAS.2006.36
Filename :
4041640
Link To Document :
بازگشت