• DocumentCode
    3487313
  • Title

    Keyword Spotting in Online Chinese Handwritten Documents with Candidate Scoring Based on Semi-CRF Model

  • Author

    Heng Zhang ; Xiang-Dong Zhou ; Cheng-Lin Liu

  • Author_Institution
    Inst. of Autom., Nat. Lab. of Pattern Recognition, Beijing, China
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    567
  • Lastpage
    571
  • Abstract
    For text-query-based keyword spotting from handwritten Chinese documents, the index is usually organized as a candidate lattice to overcome the ambiguity of character segmentation. Each edge in the lattice denotes a candidate character associated with a candidate class. Character similarity (between character and class) scores are calculated on each edge, and the similarity between a query word and handwriting is obtained by combining these edge scores. In this paper, we propose a document indexing method using semi-Markov conditional random fields (semi-CRFs), which provide a principled framework for fusing the information of different contexts. For fast retrieval and to save storage space, the lattice is first purged by a forward-backward pruning approach. On the reduced lattice, we estimate the character similarity scores based on the semi-CRF model. Experimental results on a large handwriting database CASIAOLHWDB justify the effectiveness of the proposed method.
  • Keywords
    Markov processes; document image processing; handwritten character recognition; indexing; lattice theory; query processing; sensor fusion; text analysis; CASIA-OLHWDB database; candidate scoring; character similarity scores; document indexing method; forward-backward pruning; handwriting database; information fusion; lattice; online Chinese handwritten documents; semiCRF model; semiMarkov conditional random fields; text-query-based keyword spotting; Character recognition; Computational modeling; Context; Indexes; Lattices; Pragmatics; Online Chinese handwritten documents; keyword spotting; semi-Markov conditional random fields;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.118
  • Filename
    6628683