• DocumentCode
    594767
  • Title

    A confidence-based method for keyword spotting in online Chinese handwritten documents

  • Author

    Heng Zhang ; Da-Han Wang ; Cheng-Lin Liu

  • Author_Institution
    Nat. Lab. of Pattern Recognition (NLPR), Inst. of Autom., Beijing, China
  • fYear
    2012
  • fDate
    11-15 Nov. 2012
  • Firstpage
    525
  • Lastpage
    528
  • Abstract
    In keyword spotting from handwritten documents, the word similarity is usually computed by combining character similarities. Converting similarity to probabilistic confidence is beneficial for context fusion and threshold selection. In this paper, we propose to directly estimate the posterior probability of candidate characters based on the N-best paths from the segmentation-recognitioin candidate lattice. The N-best path scores are converted to confidence measure (CM) using soft-max, and the posterior probability of candidate characters is the summation of confidence measures of paths that pass the candidate character. The parameter for CM is optimized using the binary cross-entropy criterion. Experimental results on database CASIA-OLHWDB demonstrate the effectiveness of the proposed method.
  • Keywords
    document handling; handwritten character recognition; image segmentation; natural language processing; probability; string matching; N-best paths; binary cross-entropy criterion; candidate character; confidence measurement; confidence-based method; context fusion; database CASIA-OLHWDB; keyword spotting; online Chinese handwritten documents; posterior probability; posterior probability estimation; probabilistic confidence; segmentation-recognitioin candidate lattice; soft-max; threshold selection; word similarity; Character recognition; Context; Handwriting recognition; Lattices; Text recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2012 21st International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4673-2216-4
  • Type

    conf

  • Filename
    6460187