DocumentCode :
533148
Title :
Study of high dimensional clustering algorithm based on graph partition
Author :
Yuan Gao
Author_Institution :
Sch. of Electron. & Comput. Sci. & Technol., North Univ. of China, Taiyuan, China
Volume :
13
fYear :
2010
fDate :
22-24 Oct. 2010
Abstract :
In many clustering applications, the data sets are high-dimensional, sparse and binary, resulting to the failure of traditional algorithms in handling these data. In this paper, we present a new clustering algorithm based on graph partition for high-dimensional data, which, by defining the feature vector of attribute-value distribution and the similarity of attribution-value distribution, and creating a sequence of smaller and smaller coarse graphs from the original base graph. The smallest coarse graph is then partitioned using a spectral method, and this partition is propagated back through the hierarchy of graphs. Thus, the corresponding data items in each partition are highly related. The analysis demonstrates that this algorithm is effective in clustering knowledge discover.
Keywords :
data handling; data mining; graph theory; pattern clustering; attribute value distribution; data handling; graph partition; high dimensional clustering algorithm; knowledge discover; Algorithm design and analysis; Clustering algorithms; Computational modeling; Data models; Marketing and sales; Partitioning algorithms; Data mining; Graph partition; High-dimensional clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Application and System Modeling (ICCASM), 2010 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4244-7235-2
Electronic_ISBN :
978-1-4244-7237-6
Type :
conf
DOI :
10.1109/ICCASM.2010.5622890
Filename :
5622890
Link To Document :
بازگشت