Title :
A case study on bagging, boosting and basic ensembles of neural networks for OCR
Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
Abstract :
We study the effectiveness of three neural network ensembles in improving OCR performance: basic, bagging, and boosting. Three random character degradation models are introduced for training individual networks in order to reduce error correlation between individual network and to improve the generalization ability of neural networks. We compare the recognition accuracies of these three ensembles at various rejection rates. It is shown that although the boosting ensemble is slightly more accurate than the basic and bagging ensembles at zero rejection rate, the advantage of the boosting training over the basic and bagging ensembles quickly disappears as more patterns are rejected. Eventually the basic and bagging ensembles outperform the boosting ensemble at high rejection rates. Explanation of such a phenomenon is provided. We also apply the optimal linear combiner to each of the three ensembles to capture different error correlation characteristics of the three ensembles
Keywords :
error analysis; feedforward neural nets; generalisation (artificial intelligence); learning (artificial intelligence); optical character recognition; OCR; bagging ensemble; basic ensemble; boosting ensemble; error correlation; feedforward neural networks; generalization; learning; optical character recognition; random character degradation models; Bagging; Boosting; Computer aided software engineering; Degradation; Least squares methods; Mean square error methods; Neural networks; Optical character recognition software; Robustness; Training data;
Conference_Titel :
Neural Networks Proceedings, 1998. IEEE World Congress on Computational Intelligence. The 1998 IEEE International Joint Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
0-7803-4859-1
DOI :
10.1109/IJCNN.1998.687135