Title :
Knowledge-Based Baseline Detection and Optimal Thresholding for Words Segmentation in Efficient Pre-Processing of Handwritten Arabic Text
Author :
AlKhateeb, Jawad H. ; Ren, Jinchang ; Ipson, Stan S. ; Jiang, Jianmin
Author_Institution :
Univ. of Bradford, Bradford
Abstract :
Techniques on detecting baseline and segmenting words in handwritten Arabic text are presented in this paper. Instead of using pure projection, knowledge of the location of the baseline is utilized for accurate baseline detection. Then, distances between words and subwords are respectively analyzed, and their statistical distributions are obtained to decide an optimal threshold in segmenting words. Results on IFN/ENIT database have validated our methods in terms of improved baseline detection and words segmentation for further recognition.
Keywords :
handwritten character recognition; knowledge based systems; natural language processing; text analysis; IFN/ENIT database; handwritten Arabic text; knowledge based baseline detection; optimal thresholding; words segmentation; Character recognition; Handwriting recognition; Image databases; Image edge detection; Image segmentation; Informatics; Information technology; Pixel; Statistical distributions; Text recognition; Arabic Handwriting; Baseline; Horizontal and Vertical Projection; Segmentation;
Conference_Titel :
Information Technology: New Generations, 2008. ITNG 2008. Fifth International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-3099-0
DOI :
10.1109/ITNG.2008.71