• DocumentCode
    3489969
  • Title

    Bag-of-Features HMMs for Segmentation-Free Word Spotting in Handwritten Documents

  • Author

    Rothacker, Leonard ; Rusinol, Marcal ; Fink, Glenn A.

  • Author_Institution
    Dept. of Comput. Sci., Tech. Univ. Dortmund, Dortmund, Germany
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    1305
  • Lastpage
    1309
  • Abstract
    Recent HMM-based approaches to handwritten word spotting require large amounts of learning samples and mostly rely on a prior segmentation of the document. We propose to use Bag-of-Features HMMs in a patch-based segmentation-free framework that are estimated by a single sample. Bag-of-Features HMMs use statistics of local image feature representatives. Therefore they can be considered as a variant of discrete HMMs allowing to model the observation of a number of features at a point in time. The discrete nature enables us to estimate a query model with only a single example of the query provided by the user. This makes our method very flexible with respect to the availability of training data. Furthermore, we are able to outperform state-of-the-art results on the George Washington dataset.
  • Keywords
    document image processing; feature extraction; handwriting recognition; hidden Markov models; image representation; image retrieval; image segmentation; George Washington dataset; bag-of-feature HMM; discrete HMM; handwritten document; handwritten word spotting; hidden Markov models; local image feature representatives; patch-based segmentation-free framework; query model; segmentation-free word spotting; training data; Feature extraction; Handwriting recognition; Hidden Markov models; Image segmentation; Training; Vectors; Visualization; Bag-of-Features; Handwritten Word Spotting; Hidden Markov Models; Segmentation-free Word Spotting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.264
  • Filename
    6628825