DocumentCode :
2196677
Title :
Efficient Transcript Mapping to Ease the Creation of Document Image Segmentation Ground Truth with Text-Image Alignment
Author :
Stamatopoulos, Nikolaos ; Louloudis, Georgios ; Gatos, Basilis
Author_Institution :
Comput. Intell. Lab., Nat. Center for Sci. Res. Demokritos, Athens, Greece
fYear :
2010
fDate :
16-18 Nov. 2010
Firstpage :
226
Lastpage :
231
Abstract :
One of the major issues in document image processing is the efficient creation of ground truth in order to be used for training and evaluation purposes. Since a large number of tools have to be trained and evaluated in realistic circumstances, we need to have a quick and low cost way to create the corresponding ground truth. Moreover, the specific need for having the correct text correlated with the corresponding image area in text line and word level makes the process of ground truth creation a difficult, tedious and costly task. In this paper, we introduce an efficient transcript mapping technique to ease the construction of document image segmentation ground truth that includes text-image alignment. The proposed text line transcript mapping technique is based on Hough transform that is guided by the number of the text lines. Concerning the word segmentation ground truth, a gap classification technique constrained by the number of the words is used. Experimental results prove that using the proposed technique for handwritten documents, the percentage of time saved for ground truth creation and text-image alignment is more than 90%.
Keywords :
Hough transforms; document image processing; image segmentation; text analysis; word processing; Hough transform; document image Segmentation; document image processing; gap classification technique; ground truth; handwritten documents; text line transcript mapping technique; text-image alignment; document image segmentation; ground truth creation; transcript mapping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-8353-2
Type :
conf
DOI :
10.1109/ICFHR.2010.43
Filename :
5693528
Link To Document :
بازگشت