DocumentCode :
2220369
Title :
Script and nature differentiation for Arabic and Latin text images
Author :
Kanoun, Slim ; Ennaji, Adellatif ; Lecourtier, Yves ; Alimi, Adel M.
Author_Institution :
Perception Syst. Inf. Lab., Rouen Univ., Mont Saint Aignan, France
fYear :
2002
fDate :
2002
Firstpage :
309
Lastpage :
313
Abstract :
A method for Arabic and Latin text block differentiation for printed and handwritten scripts is proposed. This method is based on a morphological analysis for each script at the text block level and a geometrical analysis at the line and the connected component level. In this paper, we present a brief survey, of existing methods used for scripts differentiation as well as a general characteristics of Arabic and Latin scripts. Then, We describe our method for the differentiation of these last scripts. We finally show two experimental results on two different data sets. 400 text blocks constitute the first one and 335 text blocks compose the second.
Keywords :
handwritten character recognition; mathematical morphology; Arabic text images; Latin text images; connected components; geometrical analysis; handwritten scripts; lines; morphological analysis; nature differentiation; printed scripts; script differentiation; text block differentiation; Conferences; Feature extraction; Handwriting recognition; Laboratories; Machine intelligence; Natural languages; Optical character recognition software; Optical devices; Optical sensors; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition, 2002. Proceedings. Eighth International Workshop on
Print_ISBN :
0-7695-1692-0
Type :
conf
DOI :
10.1109/IWFHR.2002.1030928
Filename :
1030928
Link To Document :
بازگشت