• DocumentCode
    2263682
  • Title

    Character prototype selection for handwriting recognition in historical documents

  • Author

    Fischer, Andreas ; Bunke, Horst

  • Author_Institution
    Inst. of Comput. Sci. & Appl. Math., Univ. of Bern, Bern, Switzerland
  • fYear
    2011
  • fDate
    Aug. 29 2011-Sept. 2 2011
  • Firstpage
    1435
  • Lastpage
    1439
  • Abstract
    Handwriting recognition in historical documents is vital for making scanned manuscript images amenable to searching and browsing in digital libraries. A valuable source of information is given by the basic character shapes that vary greatly for different manuscripts. Typically, character prototype images are extracted manually for bootstrapping a recognition system. This process, however, is time-consuming and the resulting prototypes may not cover all writing styles. In this paper, we propose an automatic character prototype selection method based on a forced alignment using Hidden Markov Models (HMM) and graph matching. Besides the predominant character shape given by the median or center graph, structurally different additional prototypes are retrieved with spanning and k-centers prototype selection. On the historical Parzival data set, it is demonstrated that the proposed automatic selection outperforms a manual selection for handwriting recognition with graph similarity features.
  • Keywords
    digital libraries; document image processing; graph theory; handwriting recognition; handwritten character recognition; hidden Markov models; history; image matching; online front-ends; optical character recognition; statistical analysis; HMM; automatic character prototype selection method; bootstrapping; center graph matching; character prototype images; character prototype selection; digital libraries; forced alignment; handwriting recognition; hidden Markov model; historical documents; k-center prototype selection; scanned manuscript image feature extraction; Character recognition; Feature extraction; Handwriting recognition; Hidden Markov models; Manuals; Prototypes; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2011 19th European
  • Conference_Location
    Barcelona
  • ISSN
    2076-1465
  • Type

    conf

  • Filename
    7073854