DocumentCode
2199550
Title
Lexicon reduction in an framework based on quantized feature vectors
Author
Kaufmann, G. ; Bunke, H. ; Hadorn, M.
Author_Institution
Inst. fur Inf. und Angewandte Math., Berne Univ., Switzerland
Volume
2
fYear
1997
fDate
18-20 Aug 1997
Firstpage
1097
Abstract
For many applications in cursive script recognition the vocabulary is restricted to a small and fixed set of words. In a recognition approach such as Hidden Markov models, for each of these words a model is constructed and trained. In the recognition task an unknown pattern is matched with all of these models to find the most likely class. In this paper, we describe a method for reducing the size of vocabulary depending on the actual input. In contrast to many other techniques, our approach does not use any topological features. The reduction system is directly based on the quantized feature vectors which are used as input for the HMMs. Thus very little additional work is required for lexicon reduction. The proposed approach was successfully tested on two different systems with small lexicons. In both cases the lexicon could be reduced to 25% of its original size without increasing the error rate
Keywords
feature extraction; hidden Markov models; optical character recognition; cursive script recognition; framework; hidden Markov models; lexicon reduction; quantized feature vectors; reduction system; vocabulary; vocabulary depending approach; Error analysis; Feature extraction; Hidden Markov models; Pattern matching; Pattern recognition; System testing; Text recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location
Ulm
Print_ISBN
0-8186-7898-4
Type
conf
DOI
10.1109/ICDAR.1997.620678
Filename
620678
Link To Document