Title :
Stroke Detector and Structure Based Models for Character Recognition: A Comparative Study
Author :
Cun-Zhao Shi ; Song Gao ; Meng-Tao Liu ; Cheng-Zuo Qi ; Chun-Heng Wang ; Bai-Hua Xiao
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
Abstract :
Characters, which are man-made symbols composed of strokes arranged in a certain structure, could provide semantic information and play an indispensable role in our daily life. In this paper, we try to make use of the intrinsic characteristics of characters and explore the stroke and structure-based methods for character recognition. First, we introduce two existing part-based models to recognize characters by detecting the elastic strokelike parts. In order to utilize strokes of various scales, we propose to learn the discriminative multi-scale stroke detector-based representation (DMSDR) for characters. However, the part-based models and DMSDR need to manually label the parts or key points for training. In order to learn the discriminative stroke detectors automatically, we further propose the discriminative spatiality embedded dictionary learning-based representation (DSEDR) for character recognition. We make a comparative study of the performance of the tree-structured model (TSM), mixtures-of-parts TSM, DMSDR, and DSEDR for character recognition on three challenging scene character recognition (SCR) data sets as well as two handwritten digits recognition data sets. A series of experiments is done on these data sets with various experimental setup. The experimental results demonstrate the suitability of stroke detector-based models for recognizing characters with deformations and distortions, especially in the case of limited training samples.
Keywords :
handwritten character recognition; trees (mathematics); DMSDR; DSEDR; SCR data sets; TSM; discriminative multiscale stroke detector-based representation; discriminative spatiality embedded dictionary learning-based representation; handwritten digits recognition data sets; man-made symbols; part-based models; scene character recognition data sets; structure based models; tree-structured model; Character recognition; Deformable models; Detectors; Feature extraction; Text recognition; Thyristors; Training; Character recognition; character recognition; part-based model; spatiality embedded codeword; stroke detector; structure; tree-structure;
Journal_Title :
Image Processing, IEEE Transactions on
DOI :
10.1109/TIP.2015.2473105