DocumentCode :
1230096
Title :
Preprocessing of Low-Quality Handwritten Documents Using Markov Random Fields
Author :
Cao, Huaigu ; Govindaraju, Venu
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. at Buffalo, Amherst, NY
Volume :
31
Issue :
7
fYear :
2009
fDate :
7/1/2009 12:00:00 AM
Firstpage :
1184
Lastpage :
1194
Abstract :
This paper presents a statistical approach to the preprocessing of degraded handwritten forms including the steps of binarization and form line removal. The degraded image is modeled by a Markov random field (MRF) where the hidden-layer prior probability is learned from a training set of high-quality binarized images and the observation probability density is learned on-the-fly from the gray-level histogram of the input image. We have modified the MRF model to drop the preprinted ruling lines from the image. We use the patch-based topology of the MRF and belief propagation (BP) for efficiency in processing. To further improve the processing speed, we prune unlikely solutions from the search space while solving the MRF. Experimental results show higher accuracy on two data sets of degraded handwritten images than previously used methods.
Keywords :
Markov processes; document image processing; handwriting recognition; image segmentation; probability; Markov random fields; belief propagation; binarized images; document analysis; hidden-layer prior probability; image segmentation; low-quality handwritten documents; observation probability density; Handwriting analysis; Markov random field; Markov random fields; document analysis; handwriting recognition.; image segmentation; Algorithms; Artificial Intelligence; Automatic Data Processing; Handwriting; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Markov Chains; Models, Statistical; Pattern Recognition, Automated; Reading; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2008.126
Filename :
4527250
Link To Document :
بازگشت