Title :
Research of Correction Method in the Feature Space on Text Clustering
Author :
Jiang, Xueying ; Shi, Yingjin ; Li, Shiyao
Author_Institution :
Northeastern Univ. at Qinhuangdao, Qinhuangdao, China
Abstract :
For the feature space of high-dimensional data on text clustering contains many redundant features, even "noise" features. The author proposed a feature space correction method, combine with a supervised feature selection methods and K-means clustering method. By analyzing the significance of the features in the clustering process and selecting the features that have more significance, to amend the initial feature space to exclude the less important features, give prominence to the main features, reduce the noise and improve the clustering effect.
Keywords :
pattern clustering; text analysis; K-means clustering method; feature selection methods; feature space correction method; high-dimensional data; noise features; text clustering; Classification algorithms; Clustering algorithms; Clustering methods; Data mining; Frequency measurement; Noise; Spatial databases;
Conference_Titel :
Computer Science & Service System (CSSS), 2012 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4673-0721-5
DOI :
10.1109/CSSS.2012.505