DocumentCode
2021768
Title
A High Performance European OCR System
Author
Wang, Kai ; Wang, Qingren
Author_Institution
Nankai Univ., Tianjin
Volume
1
fYear
2007
fDate
23-26 Sept. 2007
Firstpage
232
Lastpage
236
Abstract
The construction of Latin based European OCR system is studied in this paper. Compared with English, other Latin based European languages use more characters, which is called European special characters in this paper to be distinct from English letters. To construct a European system with high performance, the key is the recognition of the European special characters. In this paper, the European special characters are automatically divided into three subsets by the different handwritten position. And two solutions are proposed, one solution in which is used to recognize "i", "j " and the European special characters in subset 1, while another solution is used to recognize other English characters, digits and the European special character in other subsets. Experiment shows, the new system is more effective than the old one, which provides an experimental support for our research work.
Keywords
handwritten character recognition; natural language processing; optical character recognition; European OCR system; European special character; handwritten position; optical character recognition; Character recognition; Entropy; Machine intelligence; Natural languages; Optical character recognition software; Text analysis; Typesetting; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location
Parana
ISSN
1520-5363
Print_ISBN
978-0-7695-2822-9
Type
conf
DOI
10.1109/ICDAR.2007.4378710
Filename
4378710
Link To Document