DocumentCode
2710259
Title
A Probability Model for Projective Clustering on High Dimensional Data
Author
Chen, Lifei ; Jiang, Qingshan ; Wang, Shengrui
Author_Institution
Dept. of Comput. Sci., Fujian Normal Univ., Fuzhou
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
755
Lastpage
760
Abstract
Clustering high dimensional data is a big challenge in data mining due to the curse of dimensionality. To solve this problem, projective clustering has been defined as an extension of traditional clustering that seeks to find projected clusters in subsets of dimensions of a data space. In this paper, the problem of modeling projected clusters is first discussed, and an extended Gaussian model is proposed. Second, a general objective criterion used with k-means type projective clustering is presented based on the model. Finally, the expressions to learn model parameters are derived and then used in a new algorithm named FPC to perform fuzzy clustering on high dimensional data. The experimental results on document clustering show the effectiveness of the proposed clustering model.
Keywords
Gaussian processes; data mining; fuzzy set theory; learning (artificial intelligence); pattern clustering; probability; Gaussian model; data mining; fuzzy clustering; high dimensional data; learning algorithm; probability model; projective clustering; Clustering algorithms; Clustering methods; Computer science; Data mining; Extraterrestrial phenomena; Flexible printed circuits; Los Angeles Council; Monte Carlo methods; Partitioning algorithms; Random variables;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location
Pisa
ISSN
1550-4786
Print_ISBN
978-0-7695-3502-9
Type
conf
DOI
10.1109/ICDM.2008.15
Filename
4781174
Link To Document