DocumentCode :
1550377
Title :
In-Sample and Out-of-Sample Model Selection and Error Estimation for Support Vector Machines
Author :
Anguita, Davide ; Ghio, A. ; Oneto, L. ; Ridella, S.
Author_Institution :
DITEN, Univ. of Genova, Genova, Italy
Volume :
23
Issue :
9
fYear :
2012
Firstpage :
1390
Lastpage :
1406
Abstract :
In-sample approaches to model selection and error estimation of support vector machines (SVMs) are not as widespread as out-of-sample methods, where part of the data is removed from the training set for validation and testing purposes, mainly because their practical application is not straightforward and the latter provide, in many cases, satisfactory results. In this paper, we survey some recent and not-so-recent results of the data-dependent structural risk minimization framework and propose a proper reformulation of the SVM learning algorithm, so that the in-sample approach can be effectively applied. The experiments, performed both on simulated and real-world datasets, show that our in-sample approach can be favorably compared to out-of-sample methods, especially in cases where the latter ones provide questionable results. In particular, when the number of samples is small compared to their dimensionality, like in classification of microarray data, our proposal can outperform conventional out-of-sample approaches such as the cross validation, the leave-one-out, or the Bootstrap methods.
Keywords :
learning (artificial intelligence); statistical analysis; support vector machines; Bootstrap methods; SVM learning algorithm; data-dependent structural risk minimization framework; error estimation; in-sample model selection; microarray data; out-of-sample model selection; real-world datasets; simulated datasets; support vector machines; Complexity theory; Data models; Error analysis; Random variables; Risk management; Support vector machines; Training; Bootstrap; cross validation; error estimation; leave one out; model selection; statistical learning theory (SLT); structural risk minimization (SRM); support vector machine (SVM);
fLanguage :
English
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
2162-237X
Type :
jour
DOI :
10.1109/TNNLS.2012.2202401
Filename :
6228541
Link To Document :
بازگشت