Title :
Capacity control in linear classifiers for pattern recognition
Author :
Guyon, I. ; Vapnik, V. ; Boser, B. ; Bottou, L. ; Solla, S.A.
Author_Institution :
AT&T Bell Lab., Holmdel, NJ, USA
fDate :
30 Aug-3 Sep 1992
Abstract :
Achieving good performance in statistical pattern recognition requires matching the capacity of the classifier to the amount of training data. If the classifier has too many adjustable parameters (large capacity), it is likely to learn the training data without difficulty, but will probably not generalize properly to patterns that do not belong to the training set. Conversely, if the capacity of the classifier is not large enough, it might not be able to learn the task at all. In between, there is an optimal classifier capacity which ensures the best expected generalization for a given amount of training data. The method of structural risk minimization (SRM) refers to tuning the capacity of the classifier to the available amount of training data. This paper illustrates the method of SRM with several examples of algorithms. Experiments confirm theoretical predictions of performance improvement in application to handwritten digit recognition
Keywords :
character recognition; learning (artificial intelligence); SRM; handwritten digit recognition; linear classifiers; optimal classifier capacity; pattern recognition; structural risk minimization; training data; training set; tuning; Capacity planning; Error correction; Frequency; Handwriting recognition; Pattern matching; Pattern recognition; Risk management; Symmetric matrices; Testing; Training data;
Conference_Titel :
Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2915-0
DOI :
10.1109/ICPR.1992.201798