Title :
Implementation of distributed ROCK algorithm for clustering of large categorical datasets and its performance analysis
Author :
Patidar, Anil ; Joshi, Ritesh ; Mishra, Surendra
Author_Institution :
MCA, MITM, Indore, India
Abstract :
Clustering in data mining, is useful to discover distribution patterns in the underlying data. ROCK is one such hierarchical clustering algorithm, which works on sampled data. We show that sequential ROCK algorithm is time consuming for large dataset. Instead, we present distributed algorithms with better performance than known algorithms. We develop a robust hierarchical clustering algorithm ROCK that employs preliminary calculations to be done at different processors. In addition to presenting detailed complexity results for DROCK we also conduct an experimental study with real life data sets to demonstrate the effectiveness of our technique.
Keywords :
data mining; distributed algorithms; pattern clustering; data mining; distributed ROCK algorithm; distribution pattern discovery; large categorical dataset clustering; performance analysis; robust hierarchical clustering algorithm; sequential ROCK algorithm; Algorithm design and analysis; Clustering algorithms; Data mining; Engines; Program processors; Robustness; Rocks; Categorical Dataset; Clustering; Distributed Computing; ROCK;
Conference_Titel :
Electronics Computer Technology (ICECT), 2011 3rd International Conference on
Conference_Location :
Kanyakumari
Print_ISBN :
978-1-4244-8678-6
Electronic_ISBN :
978-1-4244-8679-3
DOI :
10.1109/ICECTECH.2011.5941659