DocumentCode :
3719766
Title :
Efficient multiscale and multifont optical character recognition system based on robust feature description
Author :
Mahmoud Soua;Rostom Kachouri;Mohamed Akil
Author_Institution :
Universit? Paris-Est, Laboratoire d´Informatique Gaspard-Monge, Equipe A3SI, ESIEE Paris, France
fYear :
2015
Firstpage :
575
Lastpage :
580
Abstract :
Optical Character Recognition (OCR) is the process of translating images of text into a comprehensible machine format. Generally, an OCR system is composed of binarization, segmentation and recognition stages. Given an extracted binary character, the recognition stage ensures its description and decides its corresponding ASCII code. In this paper, we propose a new OCR system that aims to high speed, Multiscale and Multifont character recognition. Our proposal is based essentially on robust description using a new Unified Character Descriptor (UCD). In addition, a character type-face and font-size recognition is performed to choose the adequate template for faster matching process. Obtained OCR Accuracy of our proposed System is 1.5x higher then that reached by Tesseract on the LRDE dataset.
Keywords :
"Character recognition","Optical character recognition software","Feature extraction","Image segmentation","Image edge detection","High-speed optical techniques","Optical imaging"
Publisher :
ieee
Conference_Titel :
Image Processing Theory, Tools and Applications (IPTA), 2015 International Conference on
Print_ISBN :
978-1-4799-8636-1
Electronic_ISBN :
2154-512X
Type :
conf
DOI :
10.1109/IPTA.2015.7367214
Filename :
7367214
Link To Document :
بازگشت