• DocumentCode
    2021768
  • Title

    A High Performance European OCR System

  • Author

    Wang, Kai ; Wang, Qingren

  • Author_Institution
    Nankai Univ., Tianjin
  • Volume
    1
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    232
  • Lastpage
    236
  • Abstract
    The construction of Latin based European OCR system is studied in this paper. Compared with English, other Latin based European languages use more characters, which is called European special characters in this paper to be distinct from English letters. To construct a European system with high performance, the key is the recognition of the European special characters. In this paper, the European special characters are automatically divided into three subsets by the different handwritten position. And two solutions are proposed, one solution in which is used to recognize "i", "j " and the European special characters in subset 1, while another solution is used to recognize other English characters, digits and the European special character in other subsets. Experiment shows, the new system is more effective than the old one, which provides an experimental support for our research work.
  • Keywords
    handwritten character recognition; natural language processing; optical character recognition; European OCR system; European special character; handwritten position; optical character recognition; Character recognition; Entropy; Machine intelligence; Natural languages; Optical character recognition software; Text analysis; Typesetting; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4378710
  • Filename
    4378710