DocumentCode
3570892
Title
A method for text-line segmentation for unconstrained Arabic and Persian handwritten text image
Author
Shakoori, Reza
Author_Institution
Univ. of Mumbai, Mumbai, India
fYear
2014
Firstpage
338
Lastpage
344
Abstract
One of the challenging parts of freestyle handwritten text documents recognition area is text line segmentation problem. Curvilinear text lines and small gaps between neighboring text lines present a challenge to algorithms developed for machine printed or hand-printed documents. In this paper, we propose a novel approach based on painting algorithm by dividing of a text image into number of vertical segments which is called striping. As Arabic and Persian scripts present a lot of dots, we considered historical available nastaliq scanned pages for experiments. Results show the proposed algorithm is robust to scale change, rotation, and noise. The proposed method may contribute significantly for the development of applications related to OCR.
Keywords
document image processing; handwritten character recognition; image segmentation; text detection; Arabic scripts; OCR; Persian handwritten text image; Persian scripts; curvilinear text lines; freestyle handwritten text document recognition; hand-printed documents; machine printed documents; nastaliq scanned pages; neighboring text lines; painting algorithm; striping; text line segmentation problem; unconstrained Arabic handwritten text image; vertical segments; Accuracy; Histograms; Image segmentation; Noise; Painting; Shape; Transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
Type
conf
DOI
10.1109/IRI.2014.7051909
Filename
7051909
Link To Document