DocumentCode :
3107208
Title :
A Simple Yet Effective Data Clustering Algorithm
Author :
Vadapalli, Soujanya ; Valluri, Satyanarayana R. ; Karlapalem, Kamalakar
Author_Institution :
Center for Data Eng., IIIT, Hyderabad
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
1108
Lastpage :
1112
Abstract :
In this paper, we use a simple concept based on k-reverse nearest neighbor digraphs, to develop a framework RECORD for clustering and outlier detection. We developed three algorithms - (i) RECORD algorithm (requires one parameter), (ii) Agglomerative RECORD algorithm (no parameters required) and (iii) Stability-based RECORD algorithm (no parameters required). Our experimental results with published datasets, synthetic and real-life datasets show that RECORD not only handles noisy data, but also identifies the relevant clusters. Our results are as good as (if not better than) the results got from other algorithms.
Keywords :
data handling; directed graphs; pattern clustering; agglomerative RECORD algorithm; data clustering; k-reverse nearest neighbor digraph; outlier detection; real-life dataset; stability-based RECORD algorithm; synthetic dataset; Clustering algorithms; Data engineering; Data mining; Detection algorithms; Merging; Nearest neighbor searches; Noise shaping; Optical sensors; Partitioning algorithms; Stability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.9
Filename :
4053162
Link To Document :
بازگشت