DocumentCode
2328170
Title
Systematic testing of generalization level during training in regression-type learning scenarios
Author
Zegers, Pablo ; Sundareshan, Malur K.
Author_Institution
Fac. de Ingenieria, Univ. of de los Andes Santiago, Chile
Volume
4
fYear
2004
fDate
25-29 July 2004
Firstpage
2807
Abstract
In training a learning machine (LM) with unlimited data samples available in the training set, it is important to be able to determine when the LM has attained an adequate level of generalization in order to stop the training process. While this is a problem that has not yet achieved a satisfactory solution, aiding the determination of the generalization level is the observation that as the LM becomes consistent and reaches an acceptable generalization threshold, finding samples from the training set that would make the system fail and trigger a new cycle of the training algorithm to be implemented becomes more infrequent. In a statistical sense, the number of samples that can be tested as having no new information (i.e. information not already learnt from training cycles already completed) between two successive triggers of training events asymptotically displays a faster than exponential growth behavior, which in turn provides a telltale sign of a LM reaching consistency and thus attaining a desired generalization level. This work employs some ideas taken from statistical learning theory to conjecture the existence of such exponential behavior and designs a new approach to implementing the training steps that can exploit this behavior in order to systematically test the generalization level during the training process. Examples of nonlinear regression problems are included to illustrate the ideas and to validate the methods. The obtained results are general and are independent of the configuration of the LM, its architecture, and the specific training algorithm used; hence, they are applicable to a broad class of supervised learning problems.
Keywords
generalisation (artificial intelligence); learning (artificial intelligence); regression analysis; generalization threshold; learning machine; nonlinear regression problem; regression type learning scenarios; statistical learning theory; systematic testing; Approximation algorithms; Bayesian methods; Design methodology; Displays; Learning automata; Machine learning; Statistical learning; Supervised learning; System testing; Virtual colonoscopy;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-8359-1
Type
conf
DOI
10.1109/IJCNN.2004.1381101
Filename
1381101
Link To Document