Systematic testing of generalization level during training in regression-type learning scenarios

Author

Zegers, Pablo ; Sundareshan, Malur K.

Author_Institution

Fac. de Ingenieria, Univ. of de los Andes Santiago, Chile

Volume

4

fYear

2004

fDate

25-29 July 2004

Firstpage

2807

Abstract

In training a learning machine (LM) with unlimited data samples available in the training set, it is important to be able to determine when the LM has attained an adequate level of generalization in order to stop the training process. While this is a problem that has not yet achieved a satisfactory solution, aiding the determination of the generalization level is the observation that as the LM becomes consistent and reaches an acceptable generalization threshold, finding samples from the training set that would make the system fail and trigger a new cycle of the training algorithm to be implemented becomes more infrequent. In a statistical sense, the number of samples that can be tested as having no new information (i.e. information not already learnt from training cycles already completed) between two successive triggers of training events asymptotically displays a faster than exponential growth behavior, which in turn provides a telltale sign of a LM reaching consistency and thus attaining a desired generalization level. This work employs some ideas taken from statistical learning theory to conjecture the existence of such exponential behavior and designs a new approach to implementing the training steps that can exploit this behavior in order to systematically test the generalization level during the training process. Examples of nonlinear regression problems are included to illustrate the ideas and to validate the methods. The obtained results are general and are independent of the configuration of the LM, its architecture, and the specific training algorithm used; hence, they are applicable to a broad class of supervised learning problems.

Keywords

generalisation (artificial intelligence); learning (artificial intelligence); regression analysis; generalization threshold; learning machine; nonlinear regression problem; regression type learning scenarios; statistical learning theory; systematic testing; Approximation algorithms; Bayesian methods; Design methodology; Displays; Learning automata; Machine learning; Statistical learning; Supervised learning; System testing; Virtual colonoscopy;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on

ISSN

1098-7576

Print_ISBN

0-7803-8359-1

Type

conf

DOI

10.1109/IJCNN.2004.1381101

Filename

1381101