DocumentCode :
3020067
Title :
Pre-processing methods for handwritten Arabic documents
Author :
Farooq, Faisal ; Govindaraju, Venu ; Perrone, Michael
Author_Institution :
Univ. at Buffalo, Amherst, NY, USA
fYear :
2005
fDate :
29 Aug.-1 Sept. 2005
Firstpage :
267
Abstract :
In order to improve the readability and the automatic recognition of handwritten document images, preprocessing steps are imperative. These steps in addition to conventional steps of noise removal and filtering include text normalization such as baseline correction, slant normalization and skew correction. These steps make the feature extraction process more reliable and effective. Recently Arabic handwriting recognition has received some attention from the research community. Due to the unique nature of the script, the conventional methods do not prove to be effective. In our work, we describe an orientation independent technique for baseline detection of Arabic words. In addition to that we describe, in the rest of the paper, our techniques for slant normalization, slope correction, line and word separation in handwritten Arabic documents. We show how the baseline can be exploited for slope and skew correction before proceeding with the steps of line and word separation.
Keywords :
document image processing; feature extraction; handwriting recognition; handwritten character recognition; natural languages; text analysis; automatic handwritten document image recognition; baseline correction; feature extraction; handwritten Arabic documents; skew correction; slant normalization; text normalization; Feature extraction; Filtering; Focusing; Handwriting recognition; Image recognition; Large-scale systems; Shape; Testing; Venus; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN :
1520-5263
Print_ISBN :
0-7695-2420-6
Type :
conf
DOI :
10.1109/ICDAR.2005.191
Filename :
1575551
Link To Document :
بازگشت