DocumentCode
2196355
Title
Strategies for Training Robust Neural Network Based Digit Recognizers on Unbalanced Data Sets
Author
Vajda, Szilárd ; Fink, Gernot A.
Author_Institution
Dept. of Comput. Sci., Tech. Univ. Dortmund, Dortmund, Germany
fYear
2010
fDate
16-18 Nov. 2010
Firstpage
148
Lastpage
153
Abstract
The performance of a neural network in a pattern recognition task may be influenced by several factors. One of these factors is related to the considerable difference between the number of examples belonging to each class to be recognized. The effect called imbalanced data can negatively influence the ability of a recognizer to learn the concept of the minority class. In this work we propose an under-sampling strategy based on selecting samples lying around the decision surface and an over-sampling strategy which uses kernel density estimation to populate the minority class. The experimental results on Roman and Bangla digit data using a neural network based recognizer confirm the effectiveness of the proposed solutions.
Keywords
digital arithmetic; neural nets; pattern recognition; signal sampling; text analysis; decision surface; digit recognizers; kernel density estimation; neural network; over sampling strategy; pattern recognition; training; unbalanced data sets; under-sampling strategy; digit recognition; kernel density estimation; unbalanced data;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location
Kolkata
Print_ISBN
978-1-4244-8353-2
Type
conf
DOI
10.1109/ICFHR.2010.30
Filename
5693515
Link To Document