DocumentCode :
424330
Title :
Data clustering algorithm based on binary subspace division
Author :
Wang, Hong-Bin ; Wang, Cheng-Bo ; Zhang, Li-Feng ; Zhou, Dong-Ru
Author_Institution :
Sch. of Comput., Wuhan Univ., China
Volume :
2
fYear :
2004
fDate :
26-29 Aug. 2004
Firstpage :
1249
Abstract :
Clustering is an important data analyzing method in data mining. We analyzed existing clustering algorithm and raised a new grid density clustering algorithm based on binary subspace division. Region quadtree is a type of spatial data structure based on binary division, we used this structure to 2-dimensional clustering. We also gave out the construction algorithm of region-density tree (RD-quadtree), the region merging algorithm, and the algorithm of calculating the connect component of RD-Quadtree, then extended the algorithm to high-dimensional data space and analyzed the space and time complexity of the RD-quadtree based clustering algorithm. We further proved that the RD-quadtree based clustering algorithm only did grid division of the non-empty space in the high-dimensional data space. It will lower the number of the grid unit drastically and gain higher space and time efficiency.
Keywords :
data analysis; data mining; pattern clustering; quadtrees; spatial data structures; binary division; binary subspace division; data analyzing method; data clustering algorithm; data mining; grid density clustering algorithm; region quadtree; region-density tree; spatial data structure; Algorithm design and analysis; Clustering algorithms; Data analysis; Data mining; Data structures; Electronic mail; Machine learning; Machine learning algorithms; Merging; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
Type :
conf
DOI :
10.1109/ICMLC.2004.1382383
Filename :
1382383
Link To Document :
بازگشت