DocumentCode
548995
Title
Word-level segmentation in printed and handwritten documents
Author
Silva, Lincoln Faria da ; Conci, Aura ; Sanchez, Angel
Author_Institution
Inst. de Comput., Univ. Fed. Fluminense - UFF, Niterói, Brazil
fYear
2011
fDate
16-18 June 2011
Firstpage
1
Lastpage
4
Abstract
Optical Character Recognition techniques for printed and handwritten text are quite different. Therefore, before any further document preprocessing, it is necessary to separate these text types. A fundamental step for this separation is the segmentation. In this paper we address the problem of segmentation these documents into words. The proposed system was tested in two public image databases. Many measures of efficiency were computed achieving correct separation results above 96% with respect to mean precisions and 97% for average of the accuracies. Although it would be very important compare our results with some other algorithm for the same purpose, on this moment it is impossible because there is no work in the same purpose were such comparison could be done.
Keywords
document image processing; handwritten character recognition; optical character recognition; text analysis; visual databases; word processing; handwritten document segmentation; optical character recognition technique; printed document preprocessing; public image database; word-level segmentation; Accuracy; Databases; Feature extraction; Image edge detection; Image segmentation; Object segmentation; Text analysis; Machine Vision; document analysis; handwriting; printed text; segmentation; text identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Signals and Image Processing (IWSSIP), 2011 18th International Conference on
Conference_Location
Sarajevo
ISSN
2157-8672
Print_ISBN
978-1-4577-0074-3
Type
conf
Filename
5977413
Link To Document