DocumentCode
508108
Title
The Relationship between Generalization Error and the Training Sample Number of SVM
Author
Bai, Junqing ; Yan, Guirong ; Mao, Wentao
Author_Institution
Key Lab. of Strength & Vibration of Minist. of Educ., Xi´´an Jiaotong Univ., Xi´´an, China
Volume
1
fYear
2009
fDate
14-16 Aug. 2009
Firstpage
574
Lastpage
577
Abstract
It is very important to construct the training set and determine the sample number in the regression problem. In this paper, a new idea of constructing the training set is elaborated. The key point of this idea is to choose the hyper-parameters before determining the training set. More importantly, a heuristic approach is proposed to select samples of support vector machine (SVM). Using these methods, the relationship between generalization error and the number of training samples on a given confidence level is computed. The empirical results on benchmark data (Boston Housing) and engineering data indicate that the proposed approach can give a reference to construct the proper training set. Moreover, the proposed approach has practical significance for other parametric learning machine.
Keywords
generalisation (artificial intelligence); learning (artificial intelligence); regression analysis; support vector machines; SVM training sample; generalization error; heuristic approach; parametric learning machine; regression problem; sample number; support vector machine samples selection; training set construction; Accuracy; Benchmark testing; Data engineering; Laboratories; Least squares methods; Machine learning; Predictive models; Support vector machines; System performance; System testing; Generalization Error; Training Sample; support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation, 2009. ICNC '09. Fifth International Conference on
Conference_Location
Tianjin
Print_ISBN
978-0-7695-3736-8
Type
conf
DOI
10.1109/ICNC.2009.479
Filename
5365471
Link To Document