Fast HMM-Filler Approach for Key Word Spotting in Handwritten Documents

Author

Toselli, Alejandro Hector ; Vidal, Enrique

Author_Institution

Univ. Politec. de Valencia, Valencia, Spain

fYear

2013

fDate

25-28 Aug. 2013

Firstpage

501

Lastpage

505

Abstract

The so-called filler or garbage Hidden Markov Models (HMM) are among the most widely used models for lexicon-free, query by string key word spotting in the fields of speech recognition and (lately) handwritten text recognition. An important drawback of this approach is the large computational cost of the keyword-specific HMM Viterbi decoding process needed to obtain the confidence scores of each word to be spotted. This paper presents a novel way to compute such confidence scores, directly from character lattices produced during a single Viterbi decoding process using only the "filler" model (i.e. no explicit keyword-specific decoding is needed). Experiments show that, as compared with the classical HMM-filler approach, the proposed method obtains essentially the same spotting results, while requiring between one and two orders of magnitude less query computing time.

Keywords

Viterbi decoding; document image processing; handwriting recognition; hidden Markov models; information retrieval; HMM Viterbi decoding process; character lattices; fast HMM-filler approach; garbage Hidden Markov Models; handwritten documents; handwritten text recognition; query computing time; single Viterbi decoding process; speech recognition; string key word spotting; Computational modeling; Decoding; Handwriting recognition; Hidden Markov models; Indexing; Training; Viterbi algorithm; Character Lattice; HMM-Filler Model; Spotting;

fLanguage

English

Publisher

ieee

Conference_Titel

Document Analysis and Recognition (ICDAR), 2013 12th International Conference on

Conference_Location

Washington, DC

ISSN

1520-5363

Type

conf

DOI

10.1109/ICDAR.2013.106

Filename

6628671