• DocumentCode
    1442036
  • Title

    A tight bound on concept learning

  • Author

    Takahashi, Haruhisa ; Gu, Hanzhong

  • Author_Institution
    Dept. of Commun. & Syst. Eng., Univ. of Electro-Commun., Chofu, Japan
  • Volume
    9
  • Issue
    6
  • fYear
    1998
  • fDate
    11/1/1998 12:00:00 AM
  • Firstpage
    1191
  • Lastpage
    1202
  • Abstract
    A tight bound on the generalization performance of concept learning is shown by a novel approach. Unlike existing theories, the new approach uses no assumption on large sample size as in Bayesian approach and does not consider the uniform learnability as in the VC dimension analysis. We analyze the generalization performance of some particular learning algorithm that is not necessarily well behaved, in the hope that once learning curves or sample complexity of this algorithm is obtained, it is applicable to real learning situations. The result is expressed in a dimension called Boolean interpolation dimension, and is tight in the sense that it meets the lower bound requirement of Baum and Haussler (1989). The Boolean interpolation dimension is not greater than the number of modifiable system parameters, and definable for almost all the real-world networks such as backpropagation networks and linear threshold multilayer networks. It is shown that the generalization error follows from a beta distribution of parameters m, the number of training examples, and d, the Boolean interpolation dimension. This implies that for large d, the learning results tend to the average-case result, known as the self-averaging property of the learning. The bound is shown to be applicable to the practical learning algorithms that can be modeled by the Gibbs algorithm with a uniform prior. The result is also extended to the case of inconsistent learning
  • Keywords
    generalisation (artificial intelligence); interpolation; learning (artificial intelligence); neural nets; Boolean interpolation dimension; Gibbs algorithm; backpropagation networks; beta distribution; concept learning; generalization error; generalization performance; inconsistent learning; learning curves; linear threshold multilayer networks; sample complexity; self-averaging property; tight bound; Algorithm design and analysis; Backpropagation algorithms; Bayesian methods; Interpolation; Machine learning; Multi-layer neural network; Neural networks; Nonhomogeneous media; Performance analysis; Virtual colonoscopy;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/72.728362
  • Filename
    728362