Title :
Automatic classification of handwritten and printed text in ICR boxes
Author :
Jindal, Abhishek ; Amir, Mohammad
Author_Institution :
Newgen Software Technol. Ltd., New Delhi, India
Abstract :
Machine printed and handwritten texts intermixed appear in the ICR cells of variety of documents. Recognition techniques for machine printed and handwritten text in these document images are significantly different. It is necessary to separate these two types of texts and feed them to the respective engine - OCR (Optical Character Recognition) and ICR (Intelligent Character Recognition) engine to achieve optimal performance. This paper addresses the problem of classification of machine printed and handwritten text from acquired document images. Document processors can increase their productivity and classify handwritten and printed characters inside the ICR cells and feed their images to the appropriate OCR or ICR engine for better accuracy. The algorithm is tested on variety of forms and the recognition rate is calculated to be over 91%.
Keywords :
document image processing; image classification; optical character recognition; ICR boxes; ICR engine; OCR engine; document images; document processors; handwritten text classification; intelligent character recognition; machine printed text classification; optical character recognition; recognition rate; recognition techniques; Accuracy; Conferences; Feature extraction; Image recognition; Optical character recognition software; Text analysis; Text recognition; Form Processing; Handwritten Text; ICR; ICR cells; Machine Printed Text; OCR;
Conference_Titel :
Advance Computing Conference (IACC), 2014 IEEE International
Conference_Location :
Gurgaon
Print_ISBN :
978-1-4799-2571-1
DOI :
10.1109/IAdCC.2014.6779466