DocumentCode :
3289787
Title :
Knowledge-Based Baseline Detection and Optimal Thresholding for Words Segmentation in Efficient Pre-Processing of Handwritten Arabic Text
Author :
AlKhateeb, Jawad H. ; Ren, Jinchang ; Ipson, Stan S. ; Jiang, Jianmin
Author_Institution :
Univ. of Bradford, Bradford
fYear :
2008
fDate :
7-9 April 2008
Firstpage :
1158
Lastpage :
1159
Abstract :
Techniques on detecting baseline and segmenting words in handwritten Arabic text are presented in this paper. Instead of using pure projection, knowledge of the location of the baseline is utilized for accurate baseline detection. Then, distances between words and subwords are respectively analyzed, and their statistical distributions are obtained to decide an optimal threshold in segmenting words. Results on IFN/ENIT database have validated our methods in terms of improved baseline detection and words segmentation for further recognition.
Keywords :
handwritten character recognition; knowledge based systems; natural language processing; text analysis; IFN/ENIT database; handwritten Arabic text; knowledge based baseline detection; optimal thresholding; words segmentation; Character recognition; Handwriting recognition; Image databases; Image edge detection; Image segmentation; Informatics; Information technology; Pixel; Statistical distributions; Text recognition; Arabic Handwriting; Baseline; Horizontal and Vertical Projection; Segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: New Generations, 2008. ITNG 2008. Fifth International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-3099-0
Type :
conf
DOI :
10.1109/ITNG.2008.71
Filename :
4492647
Link To Document :
بازگشت