Title :
Automatic Labeling for Scene Text Database
Author :
Iwamura, Mikio ; Tsukada, Manabu ; Kise, Kenji
Author_Institution :
Grad. Sch. of Eng., Osaka Prefecture Univ., Sakai, Japan
Abstract :
It is thought that a large quantity of data improve quality of recognition. A large database, however, is not easy to obtain. The hardest task is labeling (also known as ground truthing), which usually requires human intervention. Since labeling by human is laborious and costly, labeling without human (automatic labeling) or minimization of human intervention (semi-automatic labeling) are ideal scenarios. As a step toward realization of the scenarios, knowing how much an automatic labeling system can perform without human intervention is important. In the current paper we propose a comprehensive automatic labeling technique for a scene text database, which performs segmentation and labeling for unsegmented and unlabeled character images. To our best knowledge, this is the first method to realize the comprehensive process for automatic labeling for scene text databases In experiments, we confirmed that the proposed method could add new unlabeled data in parallel with improving recognition performance of the classifier.
Keywords :
character recognition; image classification; image segmentation; visual databases; automatic labeling system; classifier; ground truthing; scene text database; unlabeled character image segmentation; unsegmented character image labelling; Character recognition; Databases; Estimation; Feature extraction; Image segmentation; Labeling; Training; Automatic Labeling; Labeling by Recognition; Scene Character Recognition; Segmentation by Recognition; Semi-supervised Learning;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.276