DocumentCode :
3022097
Title :
Fast convolutional OCR with the scanning N-tuple grid
Author :
Lucas, Simon M. ; Cho, Kyu Tae
Author_Institution :
Dept. of Comput. Sci., Essex Univ., UK
fYear :
2005
fDate :
29 Aug.-1 Sept. 2005
Firstpage :
799
Abstract :
This paper introduces a novel high speed convolutional character recognition system. Convolutional mode operation means that no prior localization or segmentation of characters is required, making this mode extremely robust. The method uses a 2-d n-tuple grid to sample the image, but decomposes the address calculations into two one-dimensional scans. This simple innovation leads to a very fast system, and speeds in excess of 100,000 recognitions per second have been achieved for a 10-class character recognition problem, when operated in convolutional mode. Quantitative performance results show an error rate of 4.3% on the MNist dataset of isolated hand-written characters. Qualitative results are presented on museum archive card images, indicating that the method has great potential for the character recognition component in a document image analysis system for images of this type.
Keywords :
document image processing; handwritten character recognition; image segmentation; optical character recognition; MNist dataset; N-tuple grid; character segmentation; convolutional mode operation; document image analysis; handwritten character; image sampling; museum archive card image; optical character recognition; Artificial intelligence; Character recognition; Computer science; Image recognition; Image segmentation; Optical character recognition software; Pattern recognition; Robustness; Table lookup; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN :
1520-5263
Print_ISBN :
0-7695-2420-6
Type :
conf
DOI :
10.1109/ICDAR.2005.109
Filename :
1575655
Link To Document :
بازگشت