• DocumentCode
    2148083
  • Title

    A Novel Preprocessing Method for Hectography Prints Based on Independent Component Analysis

  • Author

    Kurbiel, Thomas ; Konya, Iuliu ; Eickeler, Stefan

  • Author_Institution
    Fraunhofer Inst. for Intell. Anal. & Inf. Syst. (IAIS), St. Augustin, Germany
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    1145
  • Lastpage
    1149
  • Abstract
    Archives and cultural facilities consists of vast spectra of different document classes, many of which are not encountered today anymore. The digitization therefore calls for image enhancement and preprocessing solutions so far not required and hence unsolved. A prominent document class in this context is hectography, which was an inexpensive printing and duplication method, widely used throughout the 20th century. The major challenge with hectography is poor contrast on the one side and multiple degradation effects on the other side. In this paper a novel preprocessing method for hectography duplicates is proposed which leads to better Optical Character Recognition (OCR) results compared to traditional methods that operate on grayscale images. The proposed method is based on a linear mixing model used in independent component analysis. The problem of unwanted Gaussian noise components is considered as well.
  • Keywords
    Gaussian noise; document image processing; image enhancement; independent component analysis; optical character recognition; document class digitization; grayscale images; hectography duplicates; hectography print preprocessing method; image enhancement; independent component analysis; linear mixing model; optical character recognition; unwanted Gaussian noise component; Degradation; Image color analysis; Independent component analysis; Noise; Optical character recognition software; Principal component analysis; Random variables; OCR; extraction; hectography; image enhancement; independent component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.231
  • Filename
    6065489