DocumentCode
1550377
Title
In-Sample and Out-of-Sample Model Selection and Error Estimation for Support Vector Machines
Author
Anguita, Davide ; Ghio, A. ; Oneto, L. ; Ridella, S.
Author_Institution
DITEN, Univ. of Genova, Genova, Italy
Volume
23
Issue
9
fYear
2012
Firstpage
1390
Lastpage
1406
Abstract
In-sample approaches to model selection and error estimation of support vector machines (SVMs) are not as widespread as out-of-sample methods, where part of the data is removed from the training set for validation and testing purposes, mainly because their practical application is not straightforward and the latter provide, in many cases, satisfactory results. In this paper, we survey some recent and not-so-recent results of the data-dependent structural risk minimization framework and propose a proper reformulation of the SVM learning algorithm, so that the in-sample approach can be effectively applied. The experiments, performed both on simulated and real-world datasets, show that our in-sample approach can be favorably compared to out-of-sample methods, especially in cases where the latter ones provide questionable results. In particular, when the number of samples is small compared to their dimensionality, like in classification of microarray data, our proposal can outperform conventional out-of-sample approaches such as the cross validation, the leave-one-out, or the Bootstrap methods.
Keywords
learning (artificial intelligence); statistical analysis; support vector machines; Bootstrap methods; SVM learning algorithm; data-dependent structural risk minimization framework; error estimation; in-sample model selection; microarray data; out-of-sample model selection; real-world datasets; simulated datasets; support vector machines; Complexity theory; Data models; Error analysis; Random variables; Risk management; Support vector machines; Training; Bootstrap; cross validation; error estimation; leave one out; model selection; statistical learning theory (SLT); structural risk minimization (SRM); support vector machine (SVM);
fLanguage
English
Journal_Title
Neural Networks and Learning Systems, IEEE Transactions on
Publisher
ieee
ISSN
2162-237X
Type
jour
DOI
10.1109/TNNLS.2012.2202401
Filename
6228541
Link To Document