DocumentCode :
594767
Title :
A confidence-based method for keyword spotting in online Chinese handwritten documents
Author :
Heng Zhang ; Da-Han Wang ; Cheng-Lin Liu
Author_Institution :
Nat. Lab. of Pattern Recognition (NLPR), Inst. of Autom., Beijing, China
fYear :
2012
fDate :
11-15 Nov. 2012
Firstpage :
525
Lastpage :
528
Abstract :
In keyword spotting from handwritten documents, the word similarity is usually computed by combining character similarities. Converting similarity to probabilistic confidence is beneficial for context fusion and threshold selection. In this paper, we propose to directly estimate the posterior probability of candidate characters based on the N-best paths from the segmentation-recognitioin candidate lattice. The N-best path scores are converted to confidence measure (CM) using soft-max, and the posterior probability of candidate characters is the summation of confidence measures of paths that pass the candidate character. The parameter for CM is optimized using the binary cross-entropy criterion. Experimental results on database CASIA-OLHWDB demonstrate the effectiveness of the proposed method.
Keywords :
document handling; handwritten character recognition; image segmentation; natural language processing; probability; string matching; N-best paths; binary cross-entropy criterion; candidate character; confidence measurement; confidence-based method; context fusion; database CASIA-OLHWDB; keyword spotting; online Chinese handwritten documents; posterior probability; posterior probability estimation; probabilistic confidence; segmentation-recognitioin candidate lattice; soft-max; threshold selection; word similarity; Character recognition; Context; Handwriting recognition; Lattices; Text recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
ISSN :
1051-4651
Print_ISBN :
978-1-4673-2216-4
Type :
conf
Filename :
6460187
Link To Document :
بازگشت