DocumentCode
3107208
Title
A Simple Yet Effective Data Clustering Algorithm
Author
Vadapalli, Soujanya ; Valluri, Satyanarayana R. ; Karlapalem, Kamalakar
Author_Institution
Center for Data Eng., IIIT, Hyderabad
fYear
2006
fDate
18-22 Dec. 2006
Firstpage
1108
Lastpage
1112
Abstract
In this paper, we use a simple concept based on k-reverse nearest neighbor digraphs, to develop a framework RECORD for clustering and outlier detection. We developed three algorithms - (i) RECORD algorithm (requires one parameter), (ii) Agglomerative RECORD algorithm (no parameters required) and (iii) Stability-based RECORD algorithm (no parameters required). Our experimental results with published datasets, synthetic and real-life datasets show that RECORD not only handles noisy data, but also identifies the relevant clusters. Our results are as good as (if not better than) the results got from other algorithms.
Keywords
data handling; directed graphs; pattern clustering; agglomerative RECORD algorithm; data clustering; k-reverse nearest neighbor digraph; outlier detection; real-life dataset; stability-based RECORD algorithm; synthetic dataset; Clustering algorithms; Data engineering; Data mining; Detection algorithms; Merging; Nearest neighbor searches; Noise shaping; Optical sensors; Partitioning algorithms; Stability;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location
Hong Kong
ISSN
1550-4786
Print_ISBN
0-7695-2701-7
Type
conf
DOI
10.1109/ICDM.2006.9
Filename
4053162
Link To Document