Title :
Clustering based on distortion-ratio criterion
Author_Institution :
Dept. of Inf. & Comput. Sci., Nara Women´´s Univ., Nara, Japan
Abstract :
Clustering based on a distortion-ratio criterion for data sets with large statistical differences of class data is treated, where K-Means algorithm (KMA) cannot necessarily reveal the good performance. After obtaining cluster centers attaining the minimum squared-error distortion or its approximation by KMA, clustering based on the distortion-ratio criterion, whose criterion provides a measure of the validity of clustering, is introduced. When the criterion is not satisfied, a split and merge procedure based on the distortion-ratio measure is executed. Focusing on an interesting data set which is not resolved by KMA, clustering based on the distortion-ratio criterion and the split and merge procedure is investigated.
Keywords :
distortion; least squares approximations; pattern classification; pattern clustering; statistical analysis; data set clustering; distortion-ratio criterion; k-means algorithm clustering approximation; minimum squared-error distortion; pattern classification; split-and-merge procedure; statistical difference; Clustering algorithms; Data mining; Distortion measurement; Image processing; Industrial electronics; Partitioning algorithms; Pattern recognition; Statistical distributions;
Conference_Titel :
Industrial Electronics, 2009. ISIE 2009. IEEE International Symposium on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-4347-5
Electronic_ISBN :
978-1-4244-4349-9
DOI :
10.1109/ISIE.2009.5214510