Title :
Integrating Language Model in Handwritten Chinese Text Recognition
Author :
Wang, Qiu-Feng ; Yin, Fei ; Liu, Cheng-Lin
Author_Institution :
Nat. Lab. of Pattern Recognition (NLPR), Chinese Acad. of Sci., Beijing, China
Abstract :
This paper describes a system for handwritten Chinese text recognition integrating language model. On a text line image, the system generates character segmentation and word segmentation candidates, and the candidate paths are evaluated by character recognition scores and language model. The optimal path, giving segmentation and recognition result, is found using a pruned dynamic programming search method. We evaluate various language models, including the character-based n-gram, word-based n-gram, and hybrid n-gram models. Experimental results on the HIT-HW database show that the language models improve the recognition performance remarkably.
Keywords :
dynamic programming; handwritten character recognition; image recognition; image segmentation; natural languages; search problems; statistical analysis; text analysis; HIT-HW database; character recognition score; character segmentation candidate; character-based n-gram; handwritten Chinese text recognition; hybrid n-gram model; pruned dynamic programming search method; statistical language model; text line image; word segmentation candidate; word-based n-gram; Character generation; Character recognition; Dynamic programming; Handwriting recognition; Image segmentation; Lattices; Natural languages; Pattern recognition; Text analysis; Text recognition;
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2009.96