• DocumentCode
    424330
  • Title

    Data clustering algorithm based on binary subspace division

  • Author

    Wang, Hong-Bin ; Wang, Cheng-Bo ; Zhang, Li-Feng ; Zhou, Dong-Ru

  • Author_Institution
    Sch. of Comput., Wuhan Univ., China
  • Volume
    2
  • fYear
    2004
  • fDate
    26-29 Aug. 2004
  • Firstpage
    1249
  • Abstract
    Clustering is an important data analyzing method in data mining. We analyzed existing clustering algorithm and raised a new grid density clustering algorithm based on binary subspace division. Region quadtree is a type of spatial data structure based on binary division, we used this structure to 2-dimensional clustering. We also gave out the construction algorithm of region-density tree (RD-quadtree), the region merging algorithm, and the algorithm of calculating the connect component of RD-Quadtree, then extended the algorithm to high-dimensional data space and analyzed the space and time complexity of the RD-quadtree based clustering algorithm. We further proved that the RD-quadtree based clustering algorithm only did grid division of the non-empty space in the high-dimensional data space. It will lower the number of the grid unit drastically and gain higher space and time efficiency.
  • Keywords
    data analysis; data mining; pattern clustering; quadtrees; spatial data structures; binary division; binary subspace division; data analyzing method; data clustering algorithm; data mining; grid density clustering algorithm; region quadtree; region-density tree; spatial data structure; Algorithm design and analysis; Clustering algorithms; Data analysis; Data mining; Data structures; Electronic mail; Machine learning; Machine learning algorithms; Merging; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
  • Print_ISBN
    0-7803-8403-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2004.1382383
  • Filename
    1382383