DocumentCode :
2799274
Title :
Segmentation of Printed Farsi/Arabic Words
Author :
Broumandnia, A. ; Shanbehzadeh, J. ; Nourani, M.
Author_Institution :
lslamic Azad Univ.-Tehran South Branch, Tehran
fYear :
2007
fDate :
13-16 May 2007
Firstpage :
761
Lastpage :
766
Abstract :
Characters connectivity is a problem in automated printed Farsi/Arabic script recognition. This paper introduces a novel scheme based on wavelet transform to solve segmentation of printed Farsi/Arabic words into characters. Our novel algorithm employs a new wavelet transform by which the extracted wavelet coefficients are exploited, in detecting, underlying horizontal edges and base line. Projection of horizontal edges and their location on base line provide the segmentation points. A classification method distinguishes true segmenting points. New algorithm is robust against noise, gray level, font and size of characters. Simulation results provide a comparison between new algorithm and three schemes, closed contour, structural and holistic, in terms of precision, speed and robustness against Gaussian noise. Experimental Results indicate superiority of our scheme in terms of precision and show that new algorithm improves recognition speed by a factor of at least 2.5 times.
Keywords :
edge detection; image classification; image segmentation; natural language processing; text analysis; wavelet transforms; classification method; printed Arabic script recognition; printed Arabic word segmentation; printed Farsi script recognition; printed Farsi word segmentation; wavelet coefficient extraction; wavelet transform; Background noise; Character recognition; Discrete wavelet transforms; Image edge detection; Image segmentation; Noise level; Noise robustness; Wavelet coefficients; Wavelet domain; Wavelet transforms; Image Processing; Machine Vision; OCR; Pattern Recognition; Wavelet Transform;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications, 2007. AICCSA '07. IEEE/ACS International Conference on
Conference_Location :
Amman
Print_ISBN :
1-4244-1030-4
Electronic_ISBN :
1-4244-1031-2
Type :
conf
DOI :
10.1109/AICCSA.2007.370718
Filename :
4231046
Link To Document :
بازگشت