Title :
Recovery of blurring scanned manuscript image based on wavelets transform algorithm
Author_Institution :
Sci. & Technol. Coll., North China Electr. Power Univ., Baoding, China
Abstract :
During paper manuscript scanning or photographing, which is the basis of automatic data entry for digitized documents by Optical Character Recognition (OCR) technique, the digital image blur is performed for scanning too thin or ink-bleed piece of paper and can usually cause OCR recognition errors. In this article, the combined algorithm of wavelet transform analysis and the median filter as well as the histogram adjustment is proposed for recovering Chinese character signal from the blurred mixtures. Scanning image of a piece of book page printed on thin paper-base is used to examine the algorithm, with the scanning image of the Chinese character mixed with the reverse side chart of the paper as blur signal. Simulation indicates that the combined algorithm can more effectively recover the blurring manuscript image to accuracy rate of 45% than the original image of accuracy rate only 1%. The combined algorithm proposed in this article can be directly integrated in OCR software to obtain higher accuracy for digital character recognition and automatic data entry.
Keywords :
image restoration; median filters; optical character recognition; wavelet transforms; Chinese character signal; automatic data entry; digital character recognition; image blurring; median filter; optical character recognition; wavelets transform; Accuracy; Algorithm design and analysis; Character recognition; Histograms; Optical character recognition software; Pixel; Wavelet transforms; Chinese character; OCR recognition; denosing; image processing; wavelet tansform analysis;
Conference_Titel :
Image and Signal Processing (CISP), 2010 3rd International Congress on
Conference_Location :
Yantai
Print_ISBN :
978-1-4244-6513-2
DOI :
10.1109/CISP.2010.5646863