DocumentCode :
3488170
Title :
Identification of Machine-Printed and Handwritten Words in Arabic and Latin Scripts
Author :
Saidani, A. ; Echi, Afef Kacem ; Belaid, Abdel
Author_Institution :
LaTICE-ESSTT, Univ. of Tunisia, Tunis, Tunisia
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
798
Lastpage :
802
Abstract :
Our ultimate objective is to contribute to the field of script and nature identification to be able to differentiate, at word level, handwritten or machine-printed, Arabic and Latin scripts. Different sets of features have been employed successfully for discriminating between Arabic and Latin words. They include few well-established features previously used and adapted in our case and new structural features which are intrinsic features of Arabic and Latin scripts. We select features that maximize the distinction between Arabic and Latin words. Experiments have been conducted with 1320 handwritten and printed words, covering a wide range of fonts, and encouraging results have been obtained. We achieved a correct classification of 98.4 percent for word level script and nature identification using Bayes classifier.
Keywords :
Bayes methods; feature extraction; handwritten character recognition; image classification; natural language processing; word processing; Arabic script features; Arabic script identification; Arabic words; Bayes classifier; Latin script features; Latin script identification; Latin words; handwritten word identification; machine-printed word identification; nature identification; word level script identification; Accuracy; Databases; Feature extraction; Histograms; Particle separators; Shape; Writing; classification; feature extraction; script and nature identification; word level;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.163
Filename :
6628728
Link To Document :
بازگشت