• DocumentCode
    2765777
  • Title

    A fast clustering algorithm based on grid and density

  • Author

    Sun, Zhiwei ; Zhao, Zheng ; Wang, Hongmei ; Ma, Maode ; Zhang, Lianfang ; Shu, Yantai

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Tianjin Univ.
  • fYear
    2005
  • fDate
    1-4 May 2005
  • Firstpage
    2276
  • Lastpage
    2279
  • Abstract
    The efficiency of data mining algorithms is a very important issue as data becoming larger and larger. Density-based clustering analysis can discover clusters with arbitrary shape and is insensitive to noise data. The advantage of grid-based clustering method is linear time complexity. In this paper, we present a new clustering algorithm CLUGD relying on grid and density. We first construct a grid of relevant portion. Then the algorithm finds references by grid and classifies these references to core references and bound references. Then it attaches the data of the bound references to the nearest core references and aggregation the core references in neighboring portions. At last, in-direct graph is used to classify these core references and maps cluster to original data. We performed an experimental evaluation of effectiveness and efficiency of CLUGD using synthetic data and the data of the SEQUOIA 2000 Benchmark. Both theory analysis and experimental results confirm that CLUGD can discover clusters with arbitrary shape and is insensitive to noise data. In the meanwhile, its executing efficiency is much higher than DBSCAN algorithm based on R*-tree
  • Keywords
    data mining; graph theory; pattern clustering; CLUGD; R*-tree; SEQUOIA 2000 Benchmark; bound references; core references; data mining algorithms; density-based clustering analysis; fast clustering algorithm; grid-based clustering method; in-direct graph; linear time complexity; Algorithm design and analysis; Clustering algorithms; Clustering methods; Data engineering; Data mining; Noise shaping; Partitioning algorithms; Performance evaluation; Shape; Sun;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 2005. Canadian Conference on
  • Conference_Location
    Saskatoon, Sask.
  • ISSN
    0840-7789
  • Print_ISBN
    0-7803-8885-2
  • Type

    conf

  • DOI
    10.1109/CCECE.2005.1557443
  • Filename
    1557443