A tight bound on concept learning

Author

Takahashi, Haruhisa ; Gu, Hanzhong

Author_Institution

Dept. of Commun. & Syst. Eng., Univ. of Electro-Commun., Chofu, Japan

Volume

9

Issue

6

fYear

1998

fDate

11/1/1998 12:00:00 AM

Firstpage

1191

Lastpage

1202

Abstract

A tight bound on the generalization performance of concept learning is shown by a novel approach. Unlike existing theories, the new approach uses no assumption on large sample size as in Bayesian approach and does not consider the uniform learnability as in the VC dimension analysis. We analyze the generalization performance of some particular learning algorithm that is not necessarily well behaved, in the hope that once learning curves or sample complexity of this algorithm is obtained, it is applicable to real learning situations. The result is expressed in a dimension called Boolean interpolation dimension, and is tight in the sense that it meets the lower bound requirement of Baum and Haussler (1989). The Boolean interpolation dimension is not greater than the number of modifiable system parameters, and definable for almost all the real-world networks such as backpropagation networks and linear threshold multilayer networks. It is shown that the generalization error follows from a beta distribution of parameters m, the number of training examples, and d, the Boolean interpolation dimension. This implies that for large d, the learning results tend to the average-case result, known as the self-averaging property of the learning. The bound is shown to be applicable to the practical learning algorithms that can be modeled by the Gibbs algorithm with a uniform prior. The result is also extended to the case of inconsistent learning

Keywords

generalisation (artificial intelligence); interpolation; learning (artificial intelligence); neural nets; Boolean interpolation dimension; Gibbs algorithm; backpropagation networks; beta distribution; concept learning; generalization error; generalization performance; inconsistent learning; learning curves; linear threshold multilayer networks; sample complexity; self-averaging property; tight bound; Algorithm design and analysis; Backpropagation algorithms; Bayesian methods; Interpolation; Machine learning; Multi-layer neural network; Neural networks; Nonhomogeneous media; Performance analysis; Virtual colonoscopy;

fLanguage

English

Journal_Title

Neural Networks, IEEE Transactions on

Publisher

ieee

ISSN

1045-9227

Type

jour

DOI

10.1109/72.728362

Filename

728362