DocumentCode
1864453
Title
Attributes Scaling for K-means Algorithm Controlled by Misclassification of All Clusters
Author
Siriseriwan, Wacharasak ; Sinapiromsaran, Krung
Author_Institution
Dept. of Math., Chulalongkorn Univ., Bangkok, Thailand
fYear
2010
fDate
9-10 Jan. 2010
Firstpage
220
Lastpage
223
Abstract
K-means clustering is one of the well-known distance-based clustering methods which partitions data into distinct groups. To implement an automatic attribute-scaled K-mean algorithm, the concept of classification has been integrated. Data points which belong to the same target class are considered similar in K-means clustering. In this paper, we explore and determine the optimal attribute-scaled vector that minimizes misclassification error of the target class. This paper uses the non-linear unconstrained optimization technique in attribute-scaled space, called the cyclic coordinate method together with the golden section line search to find the optimal vector. Our experiments show that the methods can provide the optimal scaling vectors which effectively reduce the misclassification error of supervised K-means clustering and lead to the effective supervised clustering in some data sets.
Keywords
data handling; learning (artificial intelligence); optimisation; pattern clustering; attribute-scaled space; attributes scaling; automatic attribute-scaled k-mean algorithm; cyclic coordinate method; data partitioning; distance-based clustering methods; golden section line search; misclassification error reduction; nonlinear unconstrained optimization; optimal attribute-scaled vector; optimal scaling vectors; supervised k-means clustering; Automatic control; Clustering algorithms; Data mining; Electronic mail; Feedback; Iterative algorithms; Mathematics; Partitioning algorithms; Supervised learning; Unsupervised learning; attribute-scaled space; cyclic coordinate method; golden section line search; k-means clustering; misclassification error;
fLanguage
English
Publisher
ieee
Conference_Titel
Knowledge Discovery and Data Mining, 2010. WKDD '10. Third International Conference on
Conference_Location
Phuket
Print_ISBN
978-1-4244-5397-9
Electronic_ISBN
978-1-4244-5398-6
Type
conf
DOI
10.1109/WKDD.2010.90
Filename
5432656
Link To Document