• DocumentCode
    2554697
  • Title

    An Efficient Clustering Algorithm for 2D Multi-density Dataset in Large Database

  • Author

    Xia, Ying ; Wang, Guoyin ; Gao, Song

  • Author_Institution
    Southwest Jiaotong Univ., Chengdu
  • fYear
    2007
  • fDate
    26-28 April 2007
  • Firstpage
    78
  • Lastpage
    82
  • Abstract
    Spatial clustering is an important component of spatial data mining. The requirement of detecting clusters of points arises in many applications. One of the challenges in spatial clustering is to find clusters on multi-density dataset. In this paper, a grid-based density-confidence-interval clustering algorithm for 2-dimensional multi-density dataset is proposed, called GDCIC. The proposed algorithm combines the density confidence interval with grid-based clustering, and produces accurate density estimation in local areas for local density thresholds. Local dense areas are distinguished from sparse areas or outliers according to these thresholds. Experiments based on both synthetic and real datasets verify that the algorithm is efficiently for multi-data sets and handle outliers effectively.
  • Keywords
    data mining; grid computing; pattern clustering; very large databases; 2D multidensity dataset; clustering algorithm; grid-based density-confidence-interval clustering algorithm; large database; local density thresholds; spatial clustering; spatial data mining; Application software; Clustering algorithms; Computer science; Data analysis; Data mining; Equations; Information science; Partitioning algorithms; Sampling methods; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Ubiquitous Engineering, 2007. MUE '07. International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    0-7695-2777-9
  • Type

    conf

  • DOI
    10.1109/MUE.2007.67
  • Filename
    4197253