• DocumentCode
    2015543
  • Title

    OCR Accuracy Improvement through a PDE-Based Approach

  • Author

    Drira, Fadoua ; Lebourgeois, Frank ; Emptoz, H.

  • Author_Institution
    LIRIS - INSA Lyon, Villeurbanne
  • Volume
    2
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    1068
  • Lastpage
    1072
  • Abstract
    This paper focuses on improving the optical character recognition (OCR) system ´s accuracy by restoring damaged character through a PDE (Partial Differential Equation)-based approach. This approach, proposed by D. Tschumperle, is an anisotropic diffusion approach driven by local tensors fields. Actually, such approach has many useful properties that are relevant for use in character restoration. For instance, this approach is very appropriate for the processing of oriented patterns which are major characteristics of textual documents. It incorporates both edge enhancing diffusion that tends to preserve local structures during smoothing and coherence-enhancing diffusion that processes oriented structures by smoothing along the flow direction. Furthermore, this tensor diffusion-based approach compared to the existing sate of the art requires neither segmentation nor training steps. Some experiments, done on degraded document images, illustrate the performance of this PDE-based approach in improving both of the visual quality and the OCR accuracy rates for degraded document images.
  • Keywords
    document image processing; edge detection; image restoration; optical character recognition; partial differential equations; OCR system accuracy improvement; PDE-based approach; coherence-enhancing diffusion; edge enhancing diffusion; optical character recognition; partial differential equation; tensor diffusion-based approach; textual document image restoration; Anisotropic magnetoresistance; Character recognition; Degradation; Differential equations; Geometrical optics; Image restoration; Optical character recognition software; Partial differential equations; Smoothing methods; Tensile stress;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4377079
  • Filename
    4377079