DocumentCode :
1580539
Title :
Word segmentation in handwritten Korean text lines based on gap clustering techniques
Author :
Kim, Soo H. ; Jeong, S. ; Lee, Guee-Sang ; Suen, Ching Y.
Author_Institution :
Centre for Pattern Recognition & Machine Intelligence, Concordia Univ., Montreal, Que., Canada
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
189
Lastpage :
193
Abstract :
We propose a word segmentation method for handwritten Korean text lines. It uses gap information to separate a text line into word units, where the gap is defined as a white-run obtained after a vertical projection of the line image. Each gap is classified into a between-word gap or a within-word gap using a clustering technique. We take up three gap metrics - the bounding box (BB), run-length/Euclidean (RLE) and convex hull (CH) distances - which are known to have superior performance in Roman-style word segmentation, and three clustering techniques - the average linkage method, the modified MAX method and sequential clustering. An experiment with 498 text-line images extracted from live mail pieces has shown that the best performance is obtained by the sequential clustering technique using all three gap metrics
Keywords :
document image processing; handwritten character recognition; image segmentation; mailing systems; pattern clustering; runlength codes; software performance evaluation; average linkage method; between-word gap; bounding box distance; convex hull distance; gap clustering techniques; handwritten Korean text lines; line image vertical projection; mail pieces; modified MAX method; performance; run-length/Euclidean distance; sequential clustering technique; white-run; within-word gap; word segmentation; word units; Character recognition; Computer science; Couplings; Handwriting recognition; Humans; Image segmentation; Machine intelligence; Pattern recognition; Postal services; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
Type :
conf
DOI :
10.1109/ICDAR.2001.953781
Filename :
953781
Link To Document :
بازگشت