• DocumentCode
    288348
  • Title

    A note on the estimation of the generalization error and the prevention of overfitting [machine learning]

  • Author

    Pados, Dimitris A. ; Papantoni-Kazakos, P.

  • Author_Institution
    Dept. of Electr. Eng., Virginia Univ., Charlottesville, VA, USA
  • Volume
    1
  • fYear
    1994
  • fDate
    27 Jun-2 Jul 1994
  • Firstpage
    321
  • Abstract
    Valid generalization and overfitting are closely related issues in the theory of machine learning. In the context of multilayer perceptrons (MLPs) it is usually assumed that improved generalization can be achieved by reducing the functionality of the network. Overfitting is battled by cross-validation methods. A different look on these subjects is proposed by this paper. A Radial-Basis-Function (RBF) network is developed that provides statistically consistent estimates of the expected generalization error induced by arbitrary MLPs. Using the RBF network as a teacher for the MLP, a new batch backpropagation-type learning algorithm is developed that minimizes directly the estimated generalization error. The on-line version of this algorithm calls for a random generalization of the training set. It is concluded that the problem of improved generalization performance is equivalent to the problem of valid generalization of the training data set. Therefore, both problems fall within the statistical context of nonparametric density estimation
  • Keywords
    backpropagation; feedforward neural nets; multilayer perceptrons; backpropagation-type learning algorithm; functionality; generalization error; multilayer perceptrons; nonparametric density estimation; prevention of overfitting; radial-basis-function network; random generalization; training data set; valid generalization; Backpropagation algorithms; Contracts; Logic functions; Machine learning; Multilayer perceptrons; Neural networks; Nonhomogeneous media; Radial basis function networks; Supervised learning; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    0-7803-1901-X
  • Type

    conf

  • DOI
    10.1109/ICNN.1994.374183
  • Filename
    374183