DocumentCode :
1660021
Title :
Training error, generalization error and learning curves in neural learning
Author :
Amari, Shun-Ichi
Author_Institution :
RIKEN Frontier Res. Program, Tokyo Univ., Japan
fYear :
1995
Firstpage :
4
Lastpage :
5
Abstract :
A neural network is trained by using a set of available examples to minimize the training error such that the network parameters fit the examples well. However, it is desired to minimize the generalization error to which no direct access is possible. There are discrepancies between the training error and the generalization error due to the statistical fluctuation of examples. The article focuses on this problem from the statistical point of view. When the number of training examples is large, we have a universal asymptotic evaluation on the discrepancies of the two errors. This can be used for model selection based on the information criterion. When the number of training examples is small, their discrepancies are big, causing a serious overfitting or overtraining problem. We analyze this phenomenon by using a simple model. It is surprising that the generalization error even increases as the number of examples increases in a certain range. This shows the adequacy of the minimum training error learning method. We evaluate various means of overcoming the overtraining such as cross validated early stopping of training, introduction of the regularization terms, model selection and others
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); minimisation; neural nets; cross validated early stopping; generalization error; information criterion; learning curves; minimum training error learning method; model selection; network parameters; neural learning; neural network; overtraining problem; regularization terms; statistical fluctuation; training error; universal asymptotic evaluation; Covariance matrix; Feedforward neural networks; Feeds; Fluctuations; Gaussian noise; Least squares methods; Neural networks; Probability distribution; Stochastic processes; Stochastic resonance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Neural Networks and Expert Systems, 1995. Proceedings., Second New Zealand International Two-Stream Conference on
Conference_Location :
Dunedin
Print_ISBN :
0-8186-7174-2
Type :
conf
DOI :
10.1109/ANNES.1995.499426
Filename :
499426
Link To Document :
بازگشت