Title :
Improved neural network OCR based on preprocessed blob classes
Author :
Fedorovici, Lucian-Ovidiu ; Voisan, Emil ; Dragan, Florin ; Iercan, Daniel
Author_Institution :
Dept. of Automatics & Appl. Inf., Univ. of Timisoara, Timisoara, Romania
Abstract :
Optical character recognition (OCR) technologies have known an effervescent development in last decade. Development was strongly influenced by the development of hardware, advance in image processing, and classification algorithms. There are multiple OCR technologies available, each of them based on different approaches, e.g., geometric processing or cognitive learning based on neural networks. One critical parameter for each of those approaches is execution time. In our opinion a very important percent of text used to be “OCRed” is coming from preprinted documents and forms, which are bounded by various regulations in layout and/or contained information. Based on this observation we argue that most of the characters that must be recognized have a similar layout, thus improvement of the processing performance can be obtained by creating classes of similar characters (blobs) based on geometric similarities, and performing OCR only on the representative blob from each class. In this paper we will present the architecture of an OCR technology based on a multilayer neural network. Performance improvement has been obtained using a blob classifier that groups characters in classes, and then perform OCR only on the representative blob from each class.
Keywords :
Biological neural networks; Character recognition; Engines; Feature extraction; Hardware; Humans; Image processing; Neural networks; Optical character recognition software; Optical computing; OCR engine; classification; neural network; performance improvements;
Conference_Titel :
Computational Cybernetics and Technical Informatics (ICCC-CONTI), 2010 International Joint Conference on
Conference_Location :
Timisoara, Romania
Print_ISBN :
978-1-4244-7432-5
DOI :
10.1109/ICCCYB.2010.5491211