Title :
Recognition enhancement by linear tournament verification
Author :
Tokahashi, H. ; Griffin, Thomas D.
Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
Abstract :
The recent significant enhancement of OCR systems recognition rates has been driven mainly by combining different feature sets or by adopting a voting scheme using multiple independent algorithms. Voting is effective but computationally expensive. A general framework for economical enhancement of recognition rate that focuses on a critical reordering of a few top candidates is described. After the execution of the base OCR algorithm (a three-layer neural network), a linear tournament verification is executed using one-to-one small network verifiers to improve the ordering of the top candidates. Thirty-four one-to-one verifiers were developed for the uppercase English alphabet. Fourteen of these use special features; however, the rest use the same features as those in the base algorithm. On the NIST uppercase data set, the recognition rate for the new system is 95.8%, showing a 1.2% improvement over the system without verification. Although the improvement is modest, the costs in both efficiency and development effort are small
Keywords :
document image processing; feedforward neural nets; optical character recognition; NIST uppercase data set; OCR systems recognition rates; computationally expensive; costs; feature sets; linear tournament verification; multiple independent algorithms; network verifiers; recognition enhancement; recognition rate; three-layer neural network; uppercase English alphabet; voting scheme; Costs; Feature extraction; Large-scale systems; NIST; Neural networks; Optical character recognition software; Ores; Robustness; Topology; Voting;
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
DOI :
10.1109/ICDAR.1993.395667