• DocumentCode
    3487079
  • Title

    Fast HMM-Filler Approach for Key Word Spotting in Handwritten Documents

  • Author

    Toselli, Alejandro Hector ; Vidal, Enrique

  • Author_Institution
    Univ. Politec. de Valencia, Valencia, Spain
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    501
  • Lastpage
    505
  • Abstract
    The so-called filler or garbage Hidden Markov Models (HMM) are among the most widely used models for lexicon-free, query by string key word spotting in the fields of speech recognition and (lately) handwritten text recognition. An important drawback of this approach is the large computational cost of the keyword-specific HMM Viterbi decoding process needed to obtain the confidence scores of each word to be spotted. This paper presents a novel way to compute such confidence scores, directly from character lattices produced during a single Viterbi decoding process using only the "filler" model (i.e. no explicit keyword-specific decoding is needed). Experiments show that, as compared with the classical HMM-filler approach, the proposed method obtains essentially the same spotting results, while requiring between one and two orders of magnitude less query computing time.
  • Keywords
    Viterbi decoding; document image processing; handwriting recognition; hidden Markov models; information retrieval; HMM Viterbi decoding process; character lattices; fast HMM-filler approach; garbage Hidden Markov Models; handwritten documents; handwritten text recognition; query computing time; single Viterbi decoding process; speech recognition; string key word spotting; Computational modeling; Decoding; Handwriting recognition; Hidden Markov models; Indexing; Training; Viterbi algorithm; Character Lattice; HMM-Filler Model; Spotting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.106
  • Filename
    6628671