Title :
High-precision two-kernel Chinese character recognition in general document processing systems
Author :
Zhao, San-Lung ; Lee, Hsi-Jian
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fDate :
6/23/1905 12:00:00 AM
Abstract :
This paper proposes a general Chinese document recognition system with high recognition rate, including preprocessing, recognition kernel, and post-processing, especially for low quality images. In the preprocessing module, fast rotation transformation algorithm is proposed. Since characters are extracted for recognition engines, document images must be segmented into text blocks, text lines, and then character images. In the recognition module, two recognition engines are used to recognize the character images. The weights of these kernels and features are calculated from the relative stroke widths of character images. In the post-processing module, we calculate confidence values for different candidates and then select the most confident candidate as the OCR result. The experiments show the system we propose is very effective and efficient
Keywords :
document image processing; feature extraction; optical character recognition; Chinese character recognition; candidate selection; character segmentation; document recognition system; feature extraction; optical character recognition; preprocessing module; recognition kernel; text blocks; two-kernel recognition engine; Character recognition; Computer science; Data mining; Engines; Image quality; Image recognition; Image segmentation; Kernel; Optical character recognition software; Text recognition;
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
DOI :
10.1109/ICDAR.2001.953863