Title :
Word-level segmentation in printed and handwritten documents
Author :
Silva, Lincoln Faria da ; Conci, Aura ; Sanchez, Angel
Author_Institution :
Inst. de Comput., Univ. Fed. Fluminense - UFF, Niterói, Brazil
Abstract :
Optical Character Recognition techniques for printed and handwritten text are quite different. Therefore, before any further document preprocessing, it is necessary to separate these text types. A fundamental step for this separation is the segmentation. In this paper we address the problem of segmentation these documents into words. The proposed system was tested in two public image databases. Many measures of efficiency were computed achieving correct separation results above 96% with respect to mean precisions and 97% for average of the accuracies. Although it would be very important compare our results with some other algorithm for the same purpose, on this moment it is impossible because there is no work in the same purpose were such comparison could be done.
Keywords :
document image processing; handwritten character recognition; optical character recognition; text analysis; visual databases; word processing; handwritten document segmentation; optical character recognition technique; printed document preprocessing; public image database; word-level segmentation; Accuracy; Databases; Feature extraction; Image edge detection; Image segmentation; Object segmentation; Text analysis; Machine Vision; document analysis; handwriting; printed text; segmentation; text identification;
Conference_Titel :
Systems, Signals and Image Processing (IWSSIP), 2011 18th International Conference on
Conference_Location :
Sarajevo
Print_ISBN :
978-1-4577-0074-3