DocumentCode
3022097
Title
Fast convolutional OCR with the scanning N-tuple grid
Author
Lucas, Simon M. ; Cho, Kyu Tae
Author_Institution
Dept. of Comput. Sci., Essex Univ., UK
fYear
2005
fDate
29 Aug.-1 Sept. 2005
Firstpage
799
Abstract
This paper introduces a novel high speed convolutional character recognition system. Convolutional mode operation means that no prior localization or segmentation of characters is required, making this mode extremely robust. The method uses a 2-d n-tuple grid to sample the image, but decomposes the address calculations into two one-dimensional scans. This simple innovation leads to a very fast system, and speeds in excess of 100,000 recognitions per second have been achieved for a 10-class character recognition problem, when operated in convolutional mode. Quantitative performance results show an error rate of 4.3% on the MNist dataset of isolated hand-written characters. Qualitative results are presented on museum archive card images, indicating that the method has great potential for the character recognition component in a document image analysis system for images of this type.
Keywords
document image processing; handwritten character recognition; image segmentation; optical character recognition; MNist dataset; N-tuple grid; character segmentation; convolutional mode operation; document image analysis; handwritten character; image sampling; museum archive card image; optical character recognition; Artificial intelligence; Character recognition; Computer science; Image recognition; Image segmentation; Optical character recognition software; Pattern recognition; Robustness; Table lookup; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN
1520-5263
Print_ISBN
0-7695-2420-6
Type
conf
DOI
10.1109/ICDAR.2005.109
Filename
1575655
Link To Document