Title :
Data clustering algorithm based on digital search tree
Author :
Xiao-Heng Zhou ; Wang, Hong-Bin ; Dong-Ru Zhou ; Meng, Bo
Author_Institution :
Sch. of Comput., Wuhan Univ., China
Abstract :
Clustering is an important data analyzing method in data mining. In this paper, we analyzed existing clustering algorithms and raised a new density-based grid clustering algorithm based on digital search tree (DST). We classified DST as a new kind of clustering method and gave out the construction algorithm of the region-density digital search tree (RD-DST) and its clustering algorithm. We then extended the algorithm to high-dimensional data space and analyzed the space and time complexities of the RD-DST based clustering algorithm. We further proved that the RD-DST based clustering algorithm only did grid division of the non-empty space in the high-dimensional data space. It lowers the number of the grid unit drastically and gain higher space and time efficiency.
Keywords :
computational complexity; data mining; tree searching; data analyzing method; data clustering; data mining; density-based grid clustering algorithm; digital search tree; grid division; high-dimensional data space; nonempty space; time complexities; Algorithm design and analysis; Classification tree analysis; Clustering algorithms; Clustering methods; Data analysis; Data mining; Electronic mail; Encoding; Partitioning algorithms; Statistics;
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
DOI :
10.1109/ICMLC.2003.1259781