• DocumentCode
    3107208
  • Title

    A Simple Yet Effective Data Clustering Algorithm

  • Author

    Vadapalli, Soujanya ; Valluri, Satyanarayana R. ; Karlapalem, Kamalakar

  • Author_Institution
    Center for Data Eng., IIIT, Hyderabad
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    1108
  • Lastpage
    1112
  • Abstract
    In this paper, we use a simple concept based on k-reverse nearest neighbor digraphs, to develop a framework RECORD for clustering and outlier detection. We developed three algorithms - (i) RECORD algorithm (requires one parameter), (ii) Agglomerative RECORD algorithm (no parameters required) and (iii) Stability-based RECORD algorithm (no parameters required). Our experimental results with published datasets, synthetic and real-life datasets show that RECORD not only handles noisy data, but also identifies the relevant clusters. Our results are as good as (if not better than) the results got from other algorithms.
  • Keywords
    data handling; directed graphs; pattern clustering; agglomerative RECORD algorithm; data clustering; k-reverse nearest neighbor digraph; outlier detection; real-life dataset; stability-based RECORD algorithm; synthetic dataset; Clustering algorithms; Data engineering; Data mining; Detection algorithms; Merging; Nearest neighbor searches; Noise shaping; Optical sensors; Partitioning algorithms; Stability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.9
  • Filename
    4053162