Title :
Tree pruning for output coded ensembles
Author :
Windeatt, T. ; Ardeshir, G.
Author_Institution :
Centre for Vision, Speech & Signal Process., Surrey Univ., Guildford, UK
Abstract :
Output coding is a method of converting a multiclass problem into several binary subproblems and gives an ensemble of binary classifiers. Like other ensemble methods, its performance depends on the accuracy and diversity of base classifiers. If a decision tree is chosen as base classifier the issue of tree pruning needs to be addressed. In this paper we investigate the effect of six methods of pruning on ensembles of trees generated by error-correcting output code (ECOC). Our results show that error-based pruning outperforms on most datasets but it is better not to prune than to select a single pruning strategy for all datasets.
Keywords :
error correction codes; probability; tree data structures; binary classifiers; binary subproblems; error-based pruning; error-correcting output code; multiclass problem; output coded ensembles; tree pruning; Classification tree analysis; Computer vision; Decision trees; Error correction codes; Mathematics; Matrix decomposition; Signal processing; Speech coding; Speech processing; Testing;
Conference_Titel :
Pattern Recognition, 2002. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-1695-X
DOI :
10.1109/ICPR.2002.1048245