Title :
Noise removal from binarized text images
Author :
Le, Huy Phat ; Lee, Gueesang
Author_Institution :
Dept. of Electron. & Comput. Eng., Chonnam Nat. Univ., Gwangju, South Korea
Abstract :
Noise is common in binarized images which are the result of extracting text from the embedded text in an image. It degrades the performance of character recognition module. In this paper, a robust algorithm is proposed to eliminate noise from the binarized text image based on the text-stroke width information. First, salt-pepper-like noises are eliminated by a morphological filter, which is to enhance the correctness of estimating the text-stroke width. Finally, a method based on the text-stroke width information is proposed to extract text from noise images. Experiments on a wide variety of binarized images with salt-pepper noise, and cluttered noise reveal the feasibility and effectiveness of our proposed approach in removing the noise.
Keywords :
document image processing; estimation theory; feature extraction; image denoising; optical character recognition; text analysis; binarized text images; character recognition module; cluttered noise; embedded text; morphological filter; noise images; noise removal; robust algorithm; salt-pepper-like noises; text-stroke width information; Algorithm design and analysis; Background noise; Character recognition; Colored noise; Data mining; Morphology; Noise shaping; Optical character recognition software; Optical noise; Shape;
Conference_Titel :
Computer and Automation Engineering (ICCAE), 2010 The 2nd International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-5585-0
Electronic_ISBN :
978-1-4244-5586-7
DOI :
10.1109/ICCAE.2010.5451814