DocumentCode :
3479542
Title :
The utilization of closing algorithm and heuristic information for broken character segmentation
Author :
Peerawit, P. ; Yingsaeree, W. ; Kawtrakul, A.
Author_Institution :
Dept. of Comput. Eng., Kasetsart Univ., Bangkok
Volume :
2
fYear :
2004
fDate :
1-3 Dec. 2004
Firstpage :
775
Lastpage :
779
Abstract :
In Thai printed character recognition systems, an important problem that decreases the accuracy is broken characters. These characters could cause the error in segmentation process. To solve this problem, a method for broken character segmentation in Thai printed document is presented. It consists of two main steps: text line detection, for extracting text lines from an image, and character segmentation, for extracting broken characters from a text line. The character segmentation consists of four steps: Gap reduction using closing algorithm, character segmentation using space, large character splitting and small character merging using heuristic information. The advantage of this approach is the ability to segment broken character even when it is split into a large number of segments. The experimental result shown that our method achieves 91.09%
Keywords :
character recognition; document image processing; feature extraction; image segmentation; Thai printed character recognition system; broken character segmentation; gap reduction; heuristic information; large character splitting; morphological closing algorithm; small character merging; text extraction; text line detection; Character recognition; Computer errors; Data mining; Heuristic algorithms; Humans; Image segmentation; Merging; Peer to peer computing; Performance analysis; Printing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cybernetics and Intelligent Systems, 2004 IEEE Conference on
Conference_Location :
Singapore
Print_ISBN :
0-7803-8643-4
Type :
conf
DOI :
10.1109/ICCIS.2004.1460686
Filename :
1460686
Link To Document :
بازگشت