DocumentCode :
3570892
Title :
A method for text-line segmentation for unconstrained Arabic and Persian handwritten text image
Author :
Shakoori, Reza
Author_Institution :
Univ. of Mumbai, Mumbai, India
fYear :
2014
Firstpage :
338
Lastpage :
344
Abstract :
One of the challenging parts of freestyle handwritten text documents recognition area is text line segmentation problem. Curvilinear text lines and small gaps between neighboring text lines present a challenge to algorithms developed for machine printed or hand-printed documents. In this paper, we propose a novel approach based on painting algorithm by dividing of a text image into number of vertical segments which is called striping. As Arabic and Persian scripts present a lot of dots, we considered historical available nastaliq scanned pages for experiments. Results show the proposed algorithm is robust to scale change, rotation, and noise. The proposed method may contribute significantly for the development of applications related to OCR.
Keywords :
document image processing; handwritten character recognition; image segmentation; text detection; Arabic scripts; OCR; Persian handwritten text image; Persian scripts; curvilinear text lines; freestyle handwritten text document recognition; hand-printed documents; machine printed documents; nastaliq scanned pages; neighboring text lines; painting algorithm; striping; text line segmentation problem; unconstrained Arabic handwritten text image; vertical segments; Accuracy; Histograms; Image segmentation; Noise; Painting; Shape; Transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
Type :
conf
DOI :
10.1109/IRI.2014.7051909
Filename :
7051909
Link To Document :
بازگشت