• DocumentCode
    1797956
  • Title

    A Transductive Support Vector Machine with adjustable quasi-linear kernel for semi-supervised data classification

  • Author

    Bo Zhou ; Chenlong Hu ; Benhui Chen ; Jinglu Hu

  • Author_Institution
    Grad. Sch. of Inf., Production & Syst., Waseda Univ. of Hibikino, Kitakyushu, Japan
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    1409
  • Lastpage
    1415
  • Abstract
    This paper focuses on semi-supervised classification problem by using Transductive Support Vector Machine. Traditional TSVM for semi-supervised classification firstly train an SVM model with labeled data. Then use the model to predict unlabeled data and optimize unlabeled data prediction to retrain the SVM. TSVM always uses a predefined kernel and fixed parameters during the optimization procedure and they also suffers potential over-fitting problem. In this paper we introduce proposed quasi-linear kernel to the TSVM. An SVM with quasi-linear kernel realizes an approximate nonlinear separation boundary by multi-local linear boundaries with interpolation. By applying quasi-linear kernel to semi-supervised classification it can avoid potential over-fitting and provide more accurate unlabeled data prediction. After unlabeled data prediction optimization, the quasi-linear kernel can be further adjusted considering the potential boundary data distribution as prior knowledge. We also introduce a minimal set method for optimizing unlabeled data prediction. The minimal set method follows the clustering assumption of semi-supervised learning. The pairwise label switching is allowed between minimal sets. It can speed up optimization procedure and reduce influence from label constrain in TSVM. Experiment results on benchmark gene datasets show that the proposed method is effective and improves classification performances.
  • Keywords
    approximation theory; data handling; interpolation; learning (artificial intelligence); optimisation; pattern classification; pattern clustering; support vector machines; SVM model; TSVM; adjustable quasi-linear kernel; clustering assumption; interpolation; minimal set method; multilocal linear boundaries; nonlinear separation boundary; optimization procedure; pairwise label switching; potential boundary data distribution; potential over-fitting problem; semisupervised data classification problem; semisupervised learning; transductive support vector machine; unlabeled data prediction optimization; Data models; Kernel; Optimization; Predictive models; Support vector machines; Switches; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889703
  • Filename
    6889703