DocumentCode
594767
Title
A confidence-based method for keyword spotting in online Chinese handwritten documents
Author
Heng Zhang ; Da-Han Wang ; Cheng-Lin Liu
Author_Institution
Nat. Lab. of Pattern Recognition (NLPR), Inst. of Autom., Beijing, China
fYear
2012
fDate
11-15 Nov. 2012
Firstpage
525
Lastpage
528
Abstract
In keyword spotting from handwritten documents, the word similarity is usually computed by combining character similarities. Converting similarity to probabilistic confidence is beneficial for context fusion and threshold selection. In this paper, we propose to directly estimate the posterior probability of candidate characters based on the N-best paths from the segmentation-recognitioin candidate lattice. The N-best path scores are converted to confidence measure (CM) using soft-max, and the posterior probability of candidate characters is the summation of confidence measures of paths that pass the candidate character. The parameter for CM is optimized using the binary cross-entropy criterion. Experimental results on database CASIA-OLHWDB demonstrate the effectiveness of the proposed method.
Keywords
document handling; handwritten character recognition; image segmentation; natural language processing; probability; string matching; N-best paths; binary cross-entropy criterion; candidate character; confidence measurement; confidence-based method; context fusion; database CASIA-OLHWDB; keyword spotting; online Chinese handwritten documents; posterior probability; posterior probability estimation; probabilistic confidence; segmentation-recognitioin candidate lattice; soft-max; threshold selection; word similarity; Character recognition; Context; Handwriting recognition; Lattices; Text recognition; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location
Tsukuba
ISSN
1051-4651
Print_ISBN
978-1-4673-2216-4
Type
conf
Filename
6460187
Link To Document