Title :
Influence of Color-to-Gray Conversion on the Performance of Document Image Binarization: Toward a Novel Optimization Problem
Author :
Hedjam, Rachid ; Nafchi, Hossein Ziaei ; Kalacska, Margaret ; Cheriet, Mohamed
Author_Institution :
Dept. of GeographyRemote Sensing Lab., McGill Univ., Montreal, QC, Canada
Abstract :
This paper presents a novel preprocessing method of color-to-gray document image conversion. In contrast to the conventional methods designed for natural images that aim to preserve the contrast between different classes in the converted gray image, the proposed conversion method reduces as much as possible the contrast (i.e., intensity variance) within the text class. It is based on learning a linear filter from a predefined data set of text and background pixels that: 1) when applied to background pixels, minimizes the output response and 2) when applied to text pixels, maximizes the output response, while minimizing the intensity variance within the text class. Our proposed method (called learning-based color-to-gray) is conceived to be used as preprocessing for document image binarization. A data set of 46 historical document images is created and used to evaluate subjectively and objectively the proposed method. The method demonstrates drastically its effectiveness and impact on the performance of state-of-the-art binarization methods. Four other Web-based image data sets are created to evaluate the scalability of the proposed method.
Keywords :
document image processing; image colour analysis; linear phase filters; optimisation; Web-based image data sets; background pixels; color-to-gray document image conversion; document image binarization performance; intensity variance minimization; linear filter; optimization problem; text pixels; Color; Degradation; Gray-scale; Image color analysis; Image edge detection; Optimization; Training; Color-to-gray image conversion; Document image binarization; Historical document analysis; Linear filter optimization; color-to-gray image conversion; historical document analysis; linear filter optimization;
Journal_Title :
Image Processing, IEEE Transactions on
DOI :
10.1109/TIP.2015.2442923