Title :
Exploiting Fisher Kernels in Decoding Severely Noisy Document Images
Author :
Chen, Jindong Jd ; Wang, Yizhou
Author_Institution :
Palo Alto Res. Center, Palo Alto
Abstract :
Decoding noisy document images is commonly needed in applications such as enterprise content management. Available OCR solutions are still not satisfactory especially on noisy images, and re-trainable systems require difficult and tedious training example preparation. Motivated by this challenging real application, we propose a novel solution that organically combines generative template models with discriminative classifiers via RBF Fisher kernel derived from a generative model. We show that the new approach is highly accurate in decoding noisy document images, making the system more generalizable to variations in font and degradation, and hence significantly reduces the burden in training example preparation. We also show that as it weights the pixel features by their relevancies, RBF Fisher kernel is more robust, and leads to smaller, faster models by dimensionality reduction.
Keywords :
document image processing; image classification; image denoising; optical character recognition; radial basis function networks; Fisher kernels; OCR solutions; RBF Fisher kernel derived; decoding severely noisy document images; dimensionality reduction; enterprise content management; noisy images; retrainable systems; Content management; Decoding; Degradation; Kernel; Machine learning; Management training; Optical character recognition software; Printing; Robustness; Shape;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4378743