• DocumentCode
    2310652
  • Title

    Semi-hard c-means clustering with application to classifier design

  • Author

    Ichihashi, H. ; Notsu, A. ; Honda, K.

  • Author_Institution
    Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
  • fYear
    2010
  • fDate
    18-23 July 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Fuzzy c-means (FCM) clustering is the method for partitioning data into clusters by minimizing an objective function. Therefore, it is important to devise an objective function from which a simple clustering algorithm can be derived. An entropy term was introduced by S. Miyamoto in the FCM objective function. We proposed an objective function of the fuzzy counterpart of Gaussian mixture models (GMMs) clustering. The objective function is based on Kullback-Leibler divergence instead of the entropy. S. Miyamoto derived a hard clustering algorithm by linearizing the K-L divergence term of the objective function. In the hard c-means (HCM) clustering approach, covariance matrices are decision variables. For quick and stable convergence of FCM-like clustering, this paper proposes the semi-hard clustering approach by constraining the membership in an interval [ab]. The semi-hard clustering result is used for a classifier design. The membership function suggested by the generalized FCM and K-L based FCM is used for the classifier. The values of hyperparameters are searched by particle swarm optimization (PSO). In terms of classification performance on UCI benchmark data, the classifier is comparable to the support vector machine (SVM) and surpasses the k-nearest neighbor (k-NN) classifier. The computation time of FCM classifier for training a large scale data set is smaller than that of the SVM classifier with decomposition and working set selection algorithms.
  • Keywords
    Gaussian processes; fuzzy set theory; particle swarm optimisation; pattern classification; Gaussian mixture model clustering; Kullback-Leibler divergence; UCI benchmark data; classifier design; covariance matrice; data partitioning; objective function; particle swarm optimization; semihard c-means clustering; Clustering algorithms; Covariance matrix; Entropy; Error analysis; Partitioning algorithms; Silicon; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ), 2010 IEEE International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4244-6919-2
  • Type

    conf

  • DOI
    10.1109/FUZZY.2010.5584553
  • Filename
    5584553