DocumentCode
2142041
Title
An MRF Model for Binarization of Natural Scene Text
Author
Mishra, Anand ; Alahari, Karteek ; Jawahar, C.V.
Author_Institution
Int. Inst. of Inf. Technol. Hyderabad, Hyderabad, India
fYear
2011
fDate
18-21 Sept. 2011
Firstpage
11
Lastpage
16
Abstract
Inspired by the success of MRF models for solving object segmentation problems, we formulate the binarization problem in this framework. We represent the pixels in a document image as random variables in an MRF, and introduce a new energy (or cost) function on these variables. Each variable takes a foreground or background label, and the quality of the binarization (or labelling) is determined by the value of the energy function. We minimize the energy function, i.e. find the optimal binarization, using an iterative graph cut scheme. Our model is robust to variations in foreground and background colours as we use a Gaussian Mixture Model in the energy function. In addition, our algorithm is efficient to compute, and adapts to a variety of document images. We show results on word images from the challenging ICDAR 2003 dataset, and compare our performance with previously reported methods. Our approach shows significant improvement in pixel level accuracy as well as OCR accuracy.
Keywords
Gaussian processes; Markov processes; document image processing; graph theory; image segmentation; iterative methods; natural scenes; Gaussian mixture model; MRF model; Markov random field; document image; energy function; iterative graph cut scheme; natural scene text binarization; object segmentation; optimal binarization; random variable; Accuracy; Image color analysis; Image edge detection; Image segmentation; Noise measurement; Optical character recognition software; Robustness; Binarization; GMM; Graph Cut; MRF;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location
Beijing
ISSN
1520-5363
Print_ISBN
978-1-4577-1350-7
Electronic_ISBN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2011.12
Filename
6065267
Link To Document