Title :
Comparison of scaling behavior between fuzzy c-means based classifier with many parameters and LibSVM
Author :
Ichihashi, Hidetomo ; Honda, Katsuhiro ; Notsu, Akira
Author_Institution :
Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
Abstract :
This paper reports the scaling behavior of the fuzzy c-means based classifier (FCMC) with many parameters. FCMC is a classifier based on clustering approaches. The classification accuracy on test sets (i.e., the generalization capability) is not necessarily improved by increasing the number of clusters. Especially when the number of training samples is relatively small, not only the classification boundary over-fits the data, but also covariance matrices and cluster centers are computed incorrectly, since the number of samples in each cluster becomes smaller. Hence, the test set accuracy deteriorates. The performance of FCMC with two clusters in each class and the number of training samples less than 1000, was reported in the literature. This paper reports the scaling behavior of FCMC by testing with variously-sized training samples. The number of clusters of FCMC is increased up to eight. The number of clusters used in this paper is not very large but the number of parameters is relatively large. So, the parameters are optimized to training sets. LibSVM is one of the widely known state of the art tools for support vector machines (SVM). The test set accuracy, training time and testing time (i.e., the detection time) of FCMC are compared with LibSVM by varying the size of training sets. FCMC shows a good generalization capability, though the parameters are optimized to training sets. When the number of training samples is increased by 10 times, the training time of FCMC increases by 10 times, but that of LibSVM increases by a factor of 100. The testing time is also much shorter than LibSVM when the size of the training set is large.
Keywords :
covariance matrices; fuzzy set theory; pattern classification; pattern clustering; support vector machines; FCMC; LibSVM; classification boundary; cluster centers; clustering approach; covariance matrices; fuzzy c-means based classifier; scaling behavior testing; support vector machine; Accuracy; Covariance matrix; Error analysis; Principal component analysis; Support vector machines; Training; classifier; clustering; large data set; scaling behavior;
Conference_Titel :
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-7315-1
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZY.2011.6007352