DocumentCode :
52123
Title :
Improving the Generalization Capacity of Cascade Classifiers
Author :
Ludwig, Oswaldo ; Nunes, U. ; Ribeiro, Bernardete ; Premebida, Cristiano
Author_Institution :
Inst. of Syst. & Robot., Univ. of Coimbra Polo II, Coimbra, Portugal
Volume :
43
Issue :
6
fYear :
2013
fDate :
Dec. 2013
Firstpage :
2135
Lastpage :
2146
Abstract :
The cascade classifier is a usual approach in object detection based on vision, since it successively rejects negative occurrences, e.g., background images, in a cascade structure, keeping the processing time suitable for on-the-fly applications. On the other hand, similar to other classifier ensembles, cascade classifiers are likely to have high Vapnik-Chervonenkis (VC) dimension, which may lead to overfitting the training data. Therefore, this work aims at improving the generalization capacity of the cascade classifier by controlling its complexity, which depends on the model of their classifier stages, the number of stages, and the feature space dimension of each stage, which can be controlled by integrating the parameter setting of the feature extractor (in our case an image descriptor) into the maximum-margin framework of support vector machine training, as will be shown in this paper. Moreover, to set the number of cascade stages, bounds on the false positive rate (FP) and on the true positive rate (TP) of cascade classifiers are derived based on a VC-style analysis. These bounds are applied to compose an enveloping receiver operating curve (EROC), i.e., a new curve in the TP-FP space in which each point is an ordered pair of upper bound on the FP and lower bound on the TP. The optimal number of cascade stages is forecasted by comparing EROCs of cascades with different numbers of stages.
Keywords :
feature extraction; generalisation (artificial intelligence); image classification; learning (artificial intelligence); support vector machines; EROC; FP; TP; VC dimension; Vapnik-Chervonenkis dimension; background images; cascade classifiers; cascade structure; classifier ensembles; classifier stages; enveloping receiver operating curve; false positive rate; feature extraction; feature space dimension; generalization capacity; maximum-margin framework; object detection; support vector machine training; true positive rate; Biological cells; Complexity theory; Feature extraction; Object detection; Support vector machines; Training; Training data; Cascade classifier; maximal margin principle; pattern recognition; pedestrian detection; statistical learning;
fLanguage :
English
Journal_Title :
Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
2168-2267
Type :
jour
DOI :
10.1109/TCYB.2013.2240678
Filename :
6459567
Link To Document :
بازگشت