DocumentCode :
3242554
Title :
Automatic Discrimination between Printed and Handwritten Text in Documents
Author :
da Silva, Leonardo F. ; Conci, Aura ; Sanchez, Angel
Author_Institution :
Inst. de Comput., Univ. Fed. Fluminense - UFF, Niteroi, Brazil
fYear :
2009
fDate :
11-15 Oct. 2009
Firstpage :
261
Lastpage :
267
Abstract :
Recognition techniques for printed and handwritten text in scanned documents are significantly different. In this paper we address the problem of identifying each type. We can list at least four steps: digitalization, preprocessing, feature extraction and decision or classification. A new aspect of our approach is the use of data mining techniques on the decision step. A new set of features extracted of each word is proposed as well. Classification rules are mining and used to discern printed text from handwritten. The proposed system was tested in two public image databases. All possible measures of efficiency were computed achieving on every occasion quantities above 80%.
Keywords :
data mining; document image processing; feature extraction; handwritten character recognition; image classification; image segmentation; optical character recognition; text analysis; classification rule mining; data mining; document automatic text discrimination; feature extraction; handwritten text; image classification; printed text; public image databases; scanned documents; text recognition; Character recognition; Classification tree analysis; Computer graphics; Data mining; Feature extraction; Hidden Markov models; Image databases; Image processing; Image segmentation; Optical character recognition software; Data Mining; Machine Vision; document analysis; optical characters recognition; text identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Graphics and Image Processing (SIBGRAPI), 2009 XXII Brazilian Symposium on
Conference_Location :
Rio de Janiero
ISSN :
1550-1834
Print_ISBN :
978-1-4244-4978-1
Electronic_ISBN :
1550-1834
Type :
conf
DOI :
10.1109/SIBGRAPI.2009.40
Filename :
5395199
Link To Document :
بازگشت