• DocumentCode
    3571086
  • Title

    A Thresholding Approach for Text Extraction in Handwritten Historical Documents Using Adaptive Morphology

  • Author

    Roy, Bishakha ; Chatterjee, Rohit Kamal

  • Author_Institution
    Dept. of Comput. Sci. & Eng., BIT, Kolkata, India
  • fYear
    2014
  • Firstpage
    198
  • Lastpage
    203
  • Abstract
    The aim of preserving historical handwritten documents is to restore the degraded text containing information. But generally global threshold fails to restore the text adequately. Adaptive (local) thresholding is required for preserving the text in these documents. In recent past many standard adaptive thresholding methods have been proposed for binarization of handwritten text document images. We propose a new adaptive thresholding method using locally adaptive mathematical morphology. Formulation of an adaptive structural element is a challenging work and addressed recently by some researchers. Our method at initial step binarizes the image applying global threshold. The residual background image below threshold containing low intensity texts mixed with noise is further processed. A new approach for constructing spatially variant operator corresponding to local variances is proposed. Gaussian surface is selected as an adaptive gray-scale structuring element for mathematical morphological operations (opening and closing), whose parameters base and height depends on local variance. The proposed method successfully demises various kinds of degraded documents enhancing textures with clear background. Experimental result on real historical handwritten document and artificial images show that our method outperforms several other existing methods both visually and using some evaluation metrics.
  • Keywords
    Gaussian processes; document image processing; feature extraction; handwritten character recognition; history; image segmentation; mathematical morphology; Gaussian surface; adaptive gray-scale structuring element; adaptive structural element; artificial images; degraded documents; degraded text; handwritten historical documents; handwritten text document images; locally adaptive mathematical morphology; low intensity texts; mathematical morphological operations; spatially variant operator; standard adaptive thresholding methods; text extraction; thresholding approach; Measurement; Morphological operations; Morphology; Noise; Standards; Surface fitting; Surface morphology; Gaussian surface; adaptive thresholding; historical handwritten document; morphology; segmentation; structurally adaptive operator;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Applications of Information Technology (EAIT), 2014 Fourth International Conference of
  • Type

    conf

  • DOI
    10.1109/EAIT.2014.65
  • Filename
    7052045