DocumentCode :
3571086
Title :
A Thresholding Approach for Text Extraction in Handwritten Historical Documents Using Adaptive Morphology
Author :
Roy, Bishakha ; Chatterjee, Rohit Kamal
Author_Institution :
Dept. of Comput. Sci. & Eng., BIT, Kolkata, India
fYear :
2014
Firstpage :
198
Lastpage :
203
Abstract :
The aim of preserving historical handwritten documents is to restore the degraded text containing information. But generally global threshold fails to restore the text adequately. Adaptive (local) thresholding is required for preserving the text in these documents. In recent past many standard adaptive thresholding methods have been proposed for binarization of handwritten text document images. We propose a new adaptive thresholding method using locally adaptive mathematical morphology. Formulation of an adaptive structural element is a challenging work and addressed recently by some researchers. Our method at initial step binarizes the image applying global threshold. The residual background image below threshold containing low intensity texts mixed with noise is further processed. A new approach for constructing spatially variant operator corresponding to local variances is proposed. Gaussian surface is selected as an adaptive gray-scale structuring element for mathematical morphological operations (opening and closing), whose parameters base and height depends on local variance. The proposed method successfully demises various kinds of degraded documents enhancing textures with clear background. Experimental result on real historical handwritten document and artificial images show that our method outperforms several other existing methods both visually and using some evaluation metrics.
Keywords :
Gaussian processes; document image processing; feature extraction; handwritten character recognition; history; image segmentation; mathematical morphology; Gaussian surface; adaptive gray-scale structuring element; adaptive structural element; artificial images; degraded documents; degraded text; handwritten historical documents; handwritten text document images; locally adaptive mathematical morphology; low intensity texts; mathematical morphological operations; spatially variant operator; standard adaptive thresholding methods; text extraction; thresholding approach; Measurement; Morphological operations; Morphology; Noise; Standards; Surface fitting; Surface morphology; Gaussian surface; adaptive thresholding; historical handwritten document; morphology; segmentation; structurally adaptive operator;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Applications of Information Technology (EAIT), 2014 Fourth International Conference of
Type :
conf
DOI :
10.1109/EAIT.2014.65
Filename :
7052045
Link To Document :
بازگشت