مرکز منطقه ای اطلاع رساني علوم و فناوري - A Transductive Support Vector Machine with adjustable quasi-linear kernel for semi-supervised data classification

DocumentCode :

1797956

Title :

A Transductive Support Vector Machine with adjustable quasi-linear kernel for semi-supervised data classification

Author :

Bo Zhou ; Chenlong Hu ; Benhui Chen ; Jinglu Hu

Author_Institution :

Grad. Sch. of Inf., Production & Syst., Waseda Univ. of Hibikino, Kitakyushu, Japan

fYear :

2014

fDate :

6-11 July 2014

Firstpage :

1409

Lastpage :

1415

Abstract :

This paper focuses on semi-supervised classification problem by using Transductive Support Vector Machine. Traditional TSVM for semi-supervised classification firstly train an SVM model with labeled data. Then use the model to predict unlabeled data and optimize unlabeled data prediction to retrain the SVM. TSVM always uses a predefined kernel and fixed parameters during the optimization procedure and they also suffers potential over-fitting problem. In this paper we introduce proposed quasi-linear kernel to the TSVM. An SVM with quasi-linear kernel realizes an approximate nonlinear separation boundary by multi-local linear boundaries with interpolation. By applying quasi-linear kernel to semi-supervised classification it can avoid potential over-fitting and provide more accurate unlabeled data prediction. After unlabeled data prediction optimization, the quasi-linear kernel can be further adjusted considering the potential boundary data distribution as prior knowledge. We also introduce a minimal set method for optimizing unlabeled data prediction. The minimal set method follows the clustering assumption of semi-supervised learning. The pairwise label switching is allowed between minimal sets. It can speed up optimization procedure and reduce influence from label constrain in TSVM. Experiment results on benchmark gene datasets show that the proposed method is effective and improves classification performances.

Keywords :

approximation theory; data handling; interpolation; learning (artificial intelligence); optimisation; pattern classification; pattern clustering; support vector machines; SVM model; TSVM; adjustable quasi-linear kernel; clustering assumption; interpolation; minimal set method; multilocal linear boundaries; nonlinear separation boundary; optimization procedure; pairwise label switching; potential boundary data distribution; potential over-fitting problem; semisupervised data classification problem; semisupervised learning; transductive support vector machine; unlabeled data prediction optimization; Data models; Kernel; Optimization; Predictive models; Support vector machines; Switches; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), 2014 International Joint Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-4799-6627-1

Type :

conf

DOI :

10.1109/IJCNN.2014.6889703

Filename :

6889703

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1797956