Title :
Study of high dimensional clustering algorithm based on graph partition
Author_Institution :
Sch. of Electron. & Comput. Sci. & Technol., North Univ. of China, Taiyuan, China
Abstract :
In many clustering applications, the data sets are high-dimensional, sparse and binary, resulting to the failure of traditional algorithms in handling these data. In this paper, we present a new clustering algorithm based on graph partition for high-dimensional data, which, by defining the feature vector of attribute-value distribution and the similarity of attribution-value distribution, and creating a sequence of smaller and smaller coarse graphs from the original base graph. The smallest coarse graph is then partitioned using a spectral method, and this partition is propagated back through the hierarchy of graphs. Thus, the corresponding data items in each partition are highly related. The analysis demonstrates that this algorithm is effective in clustering knowledge discover.
Keywords :
data handling; data mining; graph theory; pattern clustering; attribute value distribution; data handling; graph partition; high dimensional clustering algorithm; knowledge discover; Algorithm design and analysis; Clustering algorithms; Computational modeling; Data models; Marketing and sales; Partitioning algorithms; Data mining; Graph partition; High-dimensional clustering;
Conference_Titel :
Computer Application and System Modeling (ICCASM), 2010 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4244-7235-2
Electronic_ISBN :
978-1-4244-7237-6
DOI :
10.1109/ICCASM.2010.5622890