• DocumentCode
    329516
  • Title

    Document image compression using straight line extraction and block context model

  • Author

    Joung, Hwayong ; Wong, Edward K. ; Chen, Yu ; Kim, Seung P.

  • Author_Institution
    Dept. of Comput. Sci., Polytech. Univ., Brooklyn, NY, USA
  • Volume
    1
  • fYear
    1998
  • fDate
    4-7 Oct 1998
  • Firstpage
    530
  • Abstract
    We present a new lossy technique for document image compression by using straight line extraction and a block context model. Straight line segments are extracted from a binary document image and subtracted from the original image. Their endpoint coordinates and width can then be efficiently coded. The remaining part of the image, which mainly contains text and other symbols, is coded using a high-order block context model (HOBCM) based on vector quantization (VQ). The proposed method is particularly effective for document images containing a large number of straight line segments, such as engineering or architectural drawings. It achieves much higher compression than conventional lossless techniques, such as the JBIG and CCITT G3 and G4 standards, with little loss of visual quality. In the experiments we carried out, a group of engineering drawings digitized at 200 dpi, compression ratios ranging from 30 to 70 were obtained
  • Keywords
    document image processing; edge detection; feature extraction; image coding; vector quantisation; HOBCM; VQ; architectural drawings; binary document image; block context model; compression; compression ratios; document image compression; endpoint coordinates; engineering drawings; high-order block context model; lossy technique; straight line extraction; vector quantization; visual quality; width; Context modeling; Data mining; Engineering drawings; Image coding; Image segmentation; Information science; Merging; Morphology; Vector quantization; White spaces;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 1998. ICIP 98. Proceedings. 1998 International Conference on
  • Conference_Location
    Chicago, IL
  • Print_ISBN
    0-8186-8821-1
  • Type

    conf

  • DOI
    10.1109/ICIP.1998.723555
  • Filename
    723555