DocumentCode :
2142581
Title :
Segmentation of Handwritten Textlines in Presence of Touching Components
Author :
Kumar, Jayant ; Kang, Le ; Doermann, David ; Abd-Almageed, Wael
Author_Institution :
Inst. of Adv. Comput. Studies, Univ. of Maryland, College Park, MD, USA
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
109
Lastpage :
113
Abstract :
This paper presents an approach to text line extraction in handwritten document images which combines local and global techniques. We propose a graph-based technique to detect touching and proximity errors that are common with handwritten text lines. In a refinement step, we use Expectation-Maximization (EM) to iteratively split the error segments to obtain correct text-lines. We show improvement in accuracies using our correction method on datasets of Arabic document images. Results on a set of artificially generated proximity images show that the method is effective for handling touching errors in handwritten document images.
Keywords :
document image processing; expectation-maximisation algorithm; feature extraction; graph theory; handwritten character recognition; image segmentation; natural language processing; Arabic document images; error segments; expectation maximization; graph based technique; handwritten document images; handwritten textlines segmentation; proximity errors; textline extraction; touching components; Accuracy; Clustering algorithms; Educational institutions; Estimation; Image segmentation; Least squares approximation; Text analysis; Arabic; Handwritten Documents; Text-lines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.31
Filename :
6065286
Link To Document :
بازگشت