• DocumentCode
    2620349
  • Title

    A weighting scheme based on emerging patterns for weighted support vector machines

  • Author

    Fan, Hongjian ; Ramamohanarao, Kotagiri

  • Author_Institution
    Dept. of Comput. Sci. & Software Eng., Melbourne Univ., Vic., Australia
  • Volume
    2
  • fYear
    2005
  • fDate
    25-27 July 2005
  • Firstpage
    435
  • Abstract
    Support vector machines (SVMs) are powerful tools for solving classification problems and have been applied to many application fields, such as pattern recognition and data mining, in the past decade. Weighted support vector machines (weighted SVMs) extend SVMs by considering that different input vectors make different contributions to the learning of decision surface. An important issue in training weighted SVMs is how to develop a reliable weighting model to reflect the true noise distribution in the training data, i.e., noise and outliers should have low weights. In this paper, we propose to use emerging patterns (EPs) to construct such a model. EPs are those itemsets whose supports in one class are significantly higher than their supports in the other class. Since EPs of a given class represent the discriminating knowledge unique to their home class, noise and outliers should contain no EPs or EPs of the both contradicting classes, while a representative instance of the class should contain strong EPs of the same class. We calculate numeric scores for each instance based on EPs, and then assign weights to the training data using those scores. An extensive experiment carried out on a large number of benchmark datasets show that our weighting scheme often improves the performance of weighted SVMs over SVMs. We argue that the improvement is due to the ability of our model to approximate the true distribution of data points.
  • Keywords
    pattern classification; support vector machines; classification problem; emerging pattern; noise distribution; weighted support vector machine; weighting model; Application software; Computer science; Data mining; Kernel; Machine learning; Pattern recognition; Software engineering; Support vector machine classification; Support vector machines; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing, 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9017-2
  • Type

    conf

  • DOI
    10.1109/GRC.2005.1547329
  • Filename
    1547329