DocumentCode
2263682
Title
Character prototype selection for handwriting recognition in historical documents
Author
Fischer, Andreas ; Bunke, Horst
Author_Institution
Inst. of Comput. Sci. & Appl. Math., Univ. of Bern, Bern, Switzerland
fYear
2011
fDate
Aug. 29 2011-Sept. 2 2011
Firstpage
1435
Lastpage
1439
Abstract
Handwriting recognition in historical documents is vital for making scanned manuscript images amenable to searching and browsing in digital libraries. A valuable source of information is given by the basic character shapes that vary greatly for different manuscripts. Typically, character prototype images are extracted manually for bootstrapping a recognition system. This process, however, is time-consuming and the resulting prototypes may not cover all writing styles. In this paper, we propose an automatic character prototype selection method based on a forced alignment using Hidden Markov Models (HMM) and graph matching. Besides the predominant character shape given by the median or center graph, structurally different additional prototypes are retrieved with spanning and k-centers prototype selection. On the historical Parzival data set, it is demonstrated that the proposed automatic selection outperforms a manual selection for handwriting recognition with graph similarity features.
Keywords
digital libraries; document image processing; graph theory; handwriting recognition; handwritten character recognition; hidden Markov models; history; image matching; online front-ends; optical character recognition; statistical analysis; HMM; automatic character prototype selection method; bootstrapping; center graph matching; character prototype images; character prototype selection; digital libraries; forced alignment; handwriting recognition; hidden Markov model; historical documents; k-center prototype selection; scanned manuscript image feature extraction; Character recognition; Feature extraction; Handwriting recognition; Hidden Markov models; Manuals; Prototypes; Shape;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2011 19th European
Conference_Location
Barcelona
ISSN
2076-1465
Type
conf
Filename
7073854
Link To Document