DocumentCode
120834
Title
Automatic classification of handwritten and printed text in ICR boxes
Author
Jindal, Abhishek ; Amir, Mohammad
Author_Institution
Newgen Software Technol. Ltd., New Delhi, India
fYear
2014
fDate
21-22 Feb. 2014
Firstpage
1028
Lastpage
1032
Abstract
Machine printed and handwritten texts intermixed appear in the ICR cells of variety of documents. Recognition techniques for machine printed and handwritten text in these document images are significantly different. It is necessary to separate these two types of texts and feed them to the respective engine - OCR (Optical Character Recognition) and ICR (Intelligent Character Recognition) engine to achieve optimal performance. This paper addresses the problem of classification of machine printed and handwritten text from acquired document images. Document processors can increase their productivity and classify handwritten and printed characters inside the ICR cells and feed their images to the appropriate OCR or ICR engine for better accuracy. The algorithm is tested on variety of forms and the recognition rate is calculated to be over 91%.
Keywords
document image processing; image classification; optical character recognition; ICR boxes; ICR engine; OCR engine; document images; document processors; handwritten text classification; intelligent character recognition; machine printed text classification; optical character recognition; recognition rate; recognition techniques; Accuracy; Conferences; Feature extraction; Image recognition; Optical character recognition software; Text analysis; Text recognition; Form Processing; Handwritten Text; ICR; ICR cells; Machine Printed Text; OCR;
fLanguage
English
Publisher
ieee
Conference_Titel
Advance Computing Conference (IACC), 2014 IEEE International
Conference_Location
Gurgaon
Print_ISBN
978-1-4799-2571-1
Type
conf
DOI
10.1109/IAdCC.2014.6779466
Filename
6779466
Link To Document