Title :
Segmentation of touching characters in printed Korean/English document recognition
Author :
Kim, Jh-Ho ; Kim, Kye-Kyung ; Chien, Sung-Il ; Choi, Heung-Moon
Author_Institution :
Dept. of Electron. Eng., Kyungpook Sanup Univ., Kyungsan, South Korea
Abstract :
We present an algorithm for efficient segmentation of the touching characters in printed Korean and English document recognition. We derived two rules to segment touching characters in the bilingual document, one from the shape differences in writing blocks defined in this paper between Korean and English characters, and the other from the reliability factor values generated by the classifiers. The proposed method significantly improves the ability of segmentation and recognition of the actual mixed Korean and English documents
Keywords :
document image processing; image classification; image segmentation; optical character recognition; reliability; OCR; bilingual document; printed English document; printed Korean document; reliability factor values; segmentation; shape differences; touching character recognition; Character recognition; Head; Histograms; Image segmentation; Merging; Natural languages; Shape; Writing;
Conference_Titel :
Systems, Man, and Cybernetics, 1996., IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-3280-6
DOI :
10.1109/ICSMC.1996.569813