DocumentCode :
3317049
Title :
Binarization of natural scene text based on L1-Norm PCA
Author :
Jinfeng Bai ; Bailan Feng ; Bo Xu
Author_Institution :
Interactive Digital Media Technol. Res. Center, Inst. of Autom., Beijing, China
fYear :
2013
fDate :
15-19 July 2013
Firstpage :
1
Lastpage :
4
Abstract :
In this paper, a novel binarization technique is introduced for natural scene text, which can be applied after the text location step in order to improve OCR recognition. At the first step, an “optimum” conversion from color image to grayscale image is performed by minimizing L1 - Norm distance between original color image and reconstructed image on corresponding optimum projection vector. Based on it, at the second step, an approach is developed to classify scene text into two categories: “simple” and “complex”, for the purpose of optimizing the processing speed and preserving performance. At the last step, binarization is performed with different methods for “simple” and “complex” scene text. Results on word images from the challenging ICDAR 2003 dataset show that our scheme can gain higher performance compared with state-of-the-art methods in OCR accuracy.
Keywords :
image colour analysis; image reconstruction; natural scenes; optical character recognition; principal component analysis; text analysis; ICDAR 2003 dataset; L1-Norm PCA; OCR recognition; color image; grayscale image; image reconstruction; natural scene text binarization; optimum conversion; optimum projection vector; scene text classification; Accuracy; Color; Gray-scale; Image color analysis; Optical character recognition software; Principal component analysis; Vectors; Binarization; L1-Norm; Scene text;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
Type :
conf
DOI :
10.1109/ICMEW.2013.6618244
Filename :
6618244
Link To Document :
بازگشت