Title :
Script and nature differentiation for Arabic and Latin text images
Author :
Kanoun, Slim ; Ennaji, Adellatif ; Lecourtier, Yves ; Alimi, Adel M.
Author_Institution :
Perception Syst. Inf. Lab., Rouen Univ., Mont Saint Aignan, France
Abstract :
A method for Arabic and Latin text block differentiation for printed and handwritten scripts is proposed. This method is based on a morphological analysis for each script at the text block level and a geometrical analysis at the line and the connected component level. In this paper, we present a brief survey, of existing methods used for scripts differentiation as well as a general characteristics of Arabic and Latin scripts. Then, We describe our method for the differentiation of these last scripts. We finally show two experimental results on two different data sets. 400 text blocks constitute the first one and 335 text blocks compose the second.
Keywords :
handwritten character recognition; mathematical morphology; Arabic text images; Latin text images; connected components; geometrical analysis; handwritten scripts; lines; morphological analysis; nature differentiation; printed scripts; script differentiation; text block differentiation; Conferences; Feature extraction; Handwriting recognition; Laboratories; Machine intelligence; Natural languages; Optical character recognition software; Optical devices; Optical sensors; Text analysis;
Conference_Titel :
Frontiers in Handwriting Recognition, 2002. Proceedings. Eighth International Workshop on
Print_ISBN :
0-7695-1692-0
DOI :
10.1109/IWFHR.2002.1030928