DocumentCode :
146488
Title :
Recognition of off-line hand printed English Characters, Numerals and Special Symbols
Author :
Sharma, Neelam ; Kumar, Bijendra ; Singh, V.
Author_Institution :
CDAC, Noida, India
fYear :
2014
fDate :
25-26 Sept. 2014
Firstpage :
640
Lastpage :
645
Abstract :
The generic process of Optical Character Recognition (OCR), an area of intensive research in the field of Artificial Intelligence, Pattern Recognition and Computer Vision, aims to recognize text from scanned document images, where data can be in machine printed or hand written format. Optical Character Recognition can improve the interaction between man and machine in various applications including data entry, office automation, digital library, banking applications, health insurance and tax forms etc. Much of work has been done in the recognition of machine printed characters in various languages with considerably good efficiencies, however making robust recognition engines that can be put to recognize hand written and hand printed data with commendable recognition rates still remains as an active area of research owing to the challenges like diverse human handwriting style, variation in shape, angle and style of characters. Taking into account the challenges and scope for improvement in this domain, the work of off-line character recognition of hand printed document images containing English Characters-Uppercase and Lowercase, Numerals and Special Characters has been presented. Statistical, Geometric and Directional Feature Extraction techniques have been applied over segmented character image. Classification was done using Multilayer perception neural network (NN) with back propagation and Support vector machine (SVM) classifier. The recognition rates achieved were up to 98% for Numerals, 96.5% for Special Characters, 95.35% for Uppercase Characters, 92% for Lowercase Characters. The system for combined data set-Characters, Numerals and Special Symbols resulted out to be 92.167% accurate, using SVM as classifier.
Keywords :
backpropagation; feature extraction; image classification; image segmentation; multilayer perceptrons; optical character recognition; statistical analysis; support vector machines; English characters; English numerals; English special symbols; OCR; SVM classifier; artificial intelligence; backpropagation; character image segmentation; computer vision; directional feature extraction techniques; geometric feature extraction techniques; image classification; multilayer perception neural network; optical character recognition; pattern recognition; statistical techniques; support vector machine; text recognition; Accuracy; Character recognition; Feature extraction; Image segmentation; Neural networks; Optical character recognition software; Support vector machines; Hand printed character recognition; SVM classifier; Statistical; geometric and topological features; hand printed numeral recognition; neural network classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Confluence The Next Generation Information Technology Summit (Confluence), 2014 5th International Conference -
Conference_Location :
Noida
Print_ISBN :
978-1-4799-4237-4
Type :
conf
DOI :
10.1109/CONFLUENCE.2014.6949270
Filename :
6949270
Link To Document :
بازگشت