مرکز منطقه ای اطلاع رساني علوم و فناوري - A PAC-Bayesian margin bound for linear classifiers

DocumentCode :

1153375

Title :

A PAC-Bayesian margin bound for linear classifiers

Author :

Herbrich, Ralf ; Graepel, Thore

Author_Institution :

Dept. of Stat. & Bus. Math., Technische Univ. Berlin, Germany

Volume :

Issue :

fYear :

2002

fDate :

12/1/2002 12:00:00 AM

Firstpage :

3140

Lastpage :

3150

Abstract :

We present a bound on the generalization error of linear classifiers in terms of a refined margin quantity on the training sample. The result is obtained in a probably approximately correct (PAC)-Bayesian framework and is based on geometrical arguments in the space of linear classifiers. The new bound constitutes an exponential improvement of the so far tightest margin bound, which was developed in the luckiness framework, and scales logarithmically in the inverse margin. Even in the case of less training examples than input dimensions sufficiently large margins lead to nontrivial bound values and-for maximum margins-to a vanishing complexity term. In contrast to previous results, however, the new bound does depend on the dimensionality of feature space. The analysis shows that the classical margin is too coarse a measure for the essential quantity that controls the generalization error: the fraction of hypothesis space consistent with the training sample. The practical relevance of the result lies in the fact that the well-known support vector machine is optimal with respect to the new bound only if the feature vectors in the training sample are all of the same length. As a consequence, we recommend to use support vector machines (SVMs) on normalized feature vectors only. Numerical simulations support this recommendation and demonstrate that the new error bound can be used for the purpose of model selection.

Keywords :

Bayes methods; error analysis; learning (artificial intelligence); learning automata; probability; signal classification; PAC-Bayesian margin bound; complexity; error bound; exponential improvement; feature vectors; generalization error; geometrical arguments; hypothesis space; input dimensions; inverse margin; linear classifiers; luckiness framework; machine learning; maximum margins; model selection; nontrivial bound values; normalized feature vectors; numerical simulations; probably approximately correct; refined margin quantity; support vector machine; support vector machines; tightest margin bound; training sample; Codes; Error correction; Kelvin; Lattices; Silicon; Support vector machines; Vector quantization;

fLanguage :

English

Journal_Title :

Information Theory, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9448

Type :

jour

DOI :

10.1109/TIT.2002.805090

Filename :

1077809

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1153375