• DocumentCode
    304457
  • Title

    Structure-preserving document image compression

  • Author

    Kia, Omid E. ; Doermann, David S.

  • Author_Institution
    Center for Autom. Res., Maryland Univ., College Park, MD, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    16-19 Sep 1996
  • Firstpage
    193
  • Abstract
    Maintaining a document in image form is often preferable in order to avoid the high cost of manual conversion or the introduction of large numbers of errors by automatic OCR and/or graphics interpretation. The large volume of data in the image can be greatly reduced by using compression techniques. Text-intensive document images typically have a great deal of redundancy in the bitmap representations of symbols, and we make use of that redundancy for compression by clustering components, representing each cluster by a template and encoding the error. Our method is novel in modeling the error associated with each cluster and in preserving structure, an important component for readability and processing
  • Keywords
    coding errors; data compression; document image processing; image coding; image representation; automatic OCR; bitmap representations; compression techniques; data reduction; error encoding; error modeling; graphics interpretation; redundancy; structure-preserving document image compression; symbols; template; text components clustering; text-intensive document images; Automation; Costs; Degradation; Educational institutions; Image coding; Image converters; Image storage; Optical character recognition software; Propagation losses; Redundancy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 1996. Proceedings., International Conference on
  • Conference_Location
    Lausanne
  • Print_ISBN
    0-7803-3259-8
  • Type

    conf

  • DOI
    10.1109/ICIP.1996.559466
  • Filename
    559466