Title :
Semi-automatic Tibetan Component Annotation from Online Handwritten Tibetan Character Database by Optimizing Segmentation Hypotheses
Author :
Long-Long Ma ; Jian Wu
Author_Institution :
Nat. Eng. Res. Center of Fundamental Software, Inst. of Software, Beijing, China
Abstract :
One of important steps in hybrid statistical-structural recognition method for handwritten characters is to label primitives for classifier training and label structural position information for structural recognition. In this paper, we propose a semi-automatic component (primitive) annotation method for online handwritten Tibetan character database. All samples of each character class are over-segmented into sub-structure block sequences. We select correct segmentation points from one of segmented character samples and get component templates of this character class. Other samples of the same character class with sub-structure block sequences are matched with the component templates by optimizing segmentation hypotheses strategy. Character samples segmented by error are re-annotated with minimal human effort at semi-automatic re-annotation module. At last we measure the performance of our component-based recognition method on the character database with component annotation for reference.
Keywords :
handwriting recognition; image segmentation; natural language processing; optimisation; visual databases; component based recognition method; handwritten characters; hybrid statistical structural recognition method; online handwritten Tibetan character database; optimizing segmentation hypotheses; semiautomatic Tibetan component annotation; structural position information; Accuracy; Character recognition; Databases; Feature extraction; Handwriting recognition; Optimization; component; optimizing segmentation hypotheses; over-segmentation; semi-automatic annotation;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.271