Title :
Refining Spherical K-Means for Clustering Documents
Author :
Peng, Jiming ; Zhu, Jiaping
Author_Institution :
McMaster Univ., Hamilton
Abstract :
Spherical k-means is a popular algorithm for document clustering. However, it may still yield poor performance in some circumstances. In this paper, we consider a discrete optimization model for spherical k-means. By using the convexity of objective function and specific structure of constraint set, we first reformulate the discrete problem as an equivalent convex maximization problem with linear constraints. Then we characterize the local optimality of relaxed problem. Based on the characteristics, we refine the spherical k-means algorithm by alternatively performing spherical k-means and switching data points between clusters. This strategy guarantees that the refined algorithm can always attain a local optimal solution.
Keywords :
document handling; pattern clustering; convex maximization problem; discrete optimization model; document clustering; linear constraints; spherical k-means refining; Clustering algorithms; Clustering methods; Data mining; Euclidean distance; Frequency; Information retrieval; Mathematics; Standards development; Text categorization; Text mining;
Conference_Titel :
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9490-9
DOI :
10.1109/IJCNN.2006.246962