Title :
A Transductive Support Vector Machine with adjustable quasi-linear kernel for semi-supervised data classification
Author :
Bo Zhou ; Chenlong Hu ; Benhui Chen ; Jinglu Hu
Author_Institution :
Grad. Sch. of Inf., Production & Syst., Waseda Univ. of Hibikino, Kitakyushu, Japan
Abstract :
This paper focuses on semi-supervised classification problem by using Transductive Support Vector Machine. Traditional TSVM for semi-supervised classification firstly train an SVM model with labeled data. Then use the model to predict unlabeled data and optimize unlabeled data prediction to retrain the SVM. TSVM always uses a predefined kernel and fixed parameters during the optimization procedure and they also suffers potential over-fitting problem. In this paper we introduce proposed quasi-linear kernel to the TSVM. An SVM with quasi-linear kernel realizes an approximate nonlinear separation boundary by multi-local linear boundaries with interpolation. By applying quasi-linear kernel to semi-supervised classification it can avoid potential over-fitting and provide more accurate unlabeled data prediction. After unlabeled data prediction optimization, the quasi-linear kernel can be further adjusted considering the potential boundary data distribution as prior knowledge. We also introduce a minimal set method for optimizing unlabeled data prediction. The minimal set method follows the clustering assumption of semi-supervised learning. The pairwise label switching is allowed between minimal sets. It can speed up optimization procedure and reduce influence from label constrain in TSVM. Experiment results on benchmark gene datasets show that the proposed method is effective and improves classification performances.
Keywords :
approximation theory; data handling; interpolation; learning (artificial intelligence); optimisation; pattern classification; pattern clustering; support vector machines; SVM model; TSVM; adjustable quasi-linear kernel; clustering assumption; interpolation; minimal set method; multilocal linear boundaries; nonlinear separation boundary; optimization procedure; pairwise label switching; potential boundary data distribution; potential over-fitting problem; semisupervised data classification problem; semisupervised learning; transductive support vector machine; unlabeled data prediction optimization; Data models; Kernel; Optimization; Predictive models; Support vector machines; Switches; Training;
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
DOI :
10.1109/IJCNN.2014.6889703