• DocumentCode
    2199550
  • Title

    Lexicon reduction in an framework based on quantized feature vectors

  • Author

    Kaufmann, G. ; Bunke, H. ; Hadorn, M.

  • Author_Institution
    Inst. fur Inf. und Angewandte Math., Berne Univ., Switzerland
  • Volume
    2
  • fYear
    1997
  • fDate
    18-20 Aug 1997
  • Firstpage
    1097
  • Abstract
    For many applications in cursive script recognition the vocabulary is restricted to a small and fixed set of words. In a recognition approach such as Hidden Markov models, for each of these words a model is constructed and trained. In the recognition task an unknown pattern is matched with all of these models to find the most likely class. In this paper, we describe a method for reducing the size of vocabulary depending on the actual input. In contrast to many other techniques, our approach does not use any topological features. The reduction system is directly based on the quantized feature vectors which are used as input for the HMMs. Thus very little additional work is required for lexicon reduction. The proposed approach was successfully tested on two different systems with small lexicons. In both cases the lexicon could be reduced to 25% of its original size without increasing the error rate
  • Keywords
    feature extraction; hidden Markov models; optical character recognition; cursive script recognition; framework; hidden Markov models; lexicon reduction; quantized feature vectors; reduction system; vocabulary; vocabulary depending approach; Error analysis; Feature extraction; Hidden Markov models; Pattern matching; Pattern recognition; System testing; Text recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
  • Conference_Location
    Ulm
  • Print_ISBN
    0-8186-7898-4
  • Type

    conf

  • DOI
    10.1109/ICDAR.1997.620678
  • Filename
    620678