DocumentCode :
3180268
Title :
A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition
Author :
Li, Nanxi ; Jin, Lianwen
Author_Institution :
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
fYear :
2010
fDate :
10-13 Oct. 2010
Firstpage :
3664
Lastpage :
3668
Abstract :
A Bayesian-based probabilistic model is presented for unconstrained handwritten offline Chinese text line recognition. After pre-segmentation of a text line, plenty of invalid characters are produced which heavily interfere in the process of text line recognition. The proposed probabilistic model can incorporate isolated character recognition, character sample verification, and n-gram language model in a simple way, leading to more reliable recognition of a text line. When testing on HIT-MW database, experiments show that the proposed method can achieve character-level recognition accuracies of 63.19% without language model and 73.97% with bi-gram language model, respectively, outperforming the most recent results testing on the same dataset.
Keywords :
belief networks; handwritten character recognition; natural language processing; probability; text analysis; Bayesian-based probabilistic model; HIT-MW database; isolated character recognition; unconstrained handwritten offline Chinese text line recognition; Bayesian methods; Handwriting recognition; Text recognition; Chinese text line recognition; confidence measurement; handwritten Chinese character recognition; invalid character; verification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1062-922X
Print_ISBN :
978-1-4244-6586-6
Type :
conf
DOI :
10.1109/ICSMC.2010.5641873
Filename :
5641873
Link To Document :
بازگشت