• DocumentCode
    178448
  • Title

    Improving OCR Accuracy by Applying Enhancement Techniques on Multispectral Images

  • Author

    Hollaus, F. ; Diem, M. ; Sablatnig, R.

  • Author_Institution
    Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
  • fYear
    2014
  • fDate
    24-28 Aug. 2014
  • Firstpage
    3080
  • Lastpage
    3085
  • Abstract
    This work is concerned with the legibility enhancement of ancient and degraded handwritings. The writings are partially barely visible under normal white light and hence they have been imaged with a MultiSpectral Imaging (MSI) system in order to increase their legibility. Dimension reduction techniques - like Principal Component Analysis (PCA) - can be used to further enhance the contrast of the faded-out characters. In this work the dimensionality of the multispectral scan is lowered, by applying Linear Discriminant Analysis (LDA). Since LDA is a supervised dimension reduction method, it is necessary to label a subset of the multispectral samples as belonging to the fore-or background. For this purpose, an approach is suggested that uses spatial information. The enhancement method is evaluated by Optical Character Recognition (OCR). By applying the enhancement method the OCR performance is increased in the case of degraded writings, compared to OCR results gained on unprocessed multispectral images and to OCR results achieved on images, which have been produced by applying unsupervised dimension reductions.
  • Keywords
    image enhancement; optical character recognition; principal component analysis; LDA; OCR accuracy; OCR performance; PCA; degraded handwritings; dimension reduction techniques; enhancement techniques; faded-out characters; linear discriminant analysis; multispectral images; multispectral imaging system; multispectral samples; optical character recognition; principal component analysis; unprocessed multispectral images; unsupervised dimension reductions; Accuracy; Degradation; Image restoration; Image segmentation; Optical character recognition software; Principal component analysis; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2014 22nd International Conference on
  • Conference_Location
    Stockholm
  • ISSN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2014.531
  • Filename
    6977243