• DocumentCode
    2509699
  • Title

    HMM-based Word Spotting in Handwritten Documents Using Subword Models

  • Author

    Fischer, Andreas ; Keller, Andreas ; Frinken, Volkmar ; Bunke, Horst

  • Author_Institution
    Inst. of Comput. Sci. & Appl. Math., Univ. of Bern, Bern, Switzerland
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    3416
  • Lastpage
    3419
  • Abstract
    Handwritten word spotting aims at making document images amenable to browsing and searching by keyword retrieval. In this paper, we present a word spotting system based on Hidden Markov Models (HMM) that uses trained subword models to spot keywords. With the proposed method, arbitrary keywords can be spotted that do not need to be present in the training set. Also, no text line segmentation is required. On the modern IAM off-line database and the historical George Washington database we show that the proposed system outperforms a standard template matching approach based on dynamic time warping (DTW).
  • Keywords
    handwriting recognition; hidden Markov models; information retrieval; word processing; HMM-based word spotting system; IAM offline database; arbitrary keywords; dynamic time warping; handwritten documents; hidden Markov models; historical George Washington database; keyword retrieval; subword models; Adaptation model; Databases; Feature extraction; Handwriting recognition; Hidden Markov models; Image segmentation; Training; Handwriting recognition; Hidden Markov models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.834
  • Filename
    5597524