• DocumentCode
    2936001
  • Title

    A new distance measure for hierarchical clustering

  • Author

    Yavuz, Hasan Serhan ; Çevikalp, Hakan

  • Author_Institution
    Elektrik ve Elektron. Muhendisligi Bolumu, Eskisehir Osmangazi Univ., Eskisehir
  • fYear
    2008
  • fDate
    20-22 April 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Support vector machine (SVM) classifier formulation is originally designed for binary classification, and the extension of it to the multi-class case is still an open research problem. Classical approaches such as one-against-one or one-against-all have been used to address the multi-class problem, but these approaches become less appealing when the number of classes in the training set is too large. Recent approaches use hierarchical based classification for the multi-class problems since they scale well with the number of classes. SVM based hierarchical classifiers involve the partition of data samples through a clustering algorithm, and classification performance of the overall system heavily depends on the generated clusters. The clustering methods such as k-means, kernel k-means, spherical shells and balanced subset clustering have been used for this goal, but their distance measures, which are used for partitioning the data samples, are not compatible with the SVM classification goal. This paper introduces a new distance measure for partition of data samples for SVM based hierarchical classification. Unlike other clustering methods used for this goal, our proposed method is suitable when SVMs are used as the base classifier. As demonstrated in the experiments, integrating the proposed clustering scheme into the hierarchical SVM classifiers significantly improves the computational efficiency with a small decrease in the recognition accuracy.
  • Keywords
    pattern classification; pattern clustering; support vector machines; binary classification; data samples partition; hierarchical SVM classifiers; hierarchical classification; hierarchical clustering; support vector machine classifier formulation; Classification algorithms; Clustering algorithms; Clustering methods; Computational efficiency; Kernel; Partitioning algorithms; Support vector machine classification; Support vector machines; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, Communication and Applications Conference, 2008. SIU 2008. IEEE 16th
  • Conference_Location
    Aydin
  • Print_ISBN
    978-1-4244-1998-2
  • Electronic_ISBN
    978-1-4244-1999-9
  • Type

    conf

  • DOI
    10.1109/SIU.2008.4632558
  • Filename
    4632558