DocumentCode :
3266392
Title :
Bi-level document image compression using layout information
Author :
Inglis, Stuart J. ; Witten, Ian H.
Author_Institution :
Dept. of Comput. Sci., Waikato Univ., Hamilton, New Zealand
fYear :
1996
fDate :
Mar/Apr 1996
Firstpage :
442
Abstract :
Most bi-level images stored on computers today comprise scanned text, and are stored using generic bi-level image technology based either on classical run-length coding, such as the CCITT Group 4 method, or on modern schemes such as JBIG that predict pixels from their local image context. However, image compression methods that are tailored specifically for images known to contain printed text can provide noticeably superior performance because they effectively enlarge the context to the character level, at least for those predictions for which such a context is relevant. To deal effectively with general documents that contain text and pictures, it is necessary to detect layout and structural information from the image, and employ different compression techniques for different parts of the image. The authors extend previous work in document image compression in two ways. First, we include automatic discrimination between text and non-text zones in an image. Second, the system is tested on a large real-world image corpus
Keywords :
data compression; document handling; document image processing; image coding; prediction theory; runlength codes; CCITT Group 4 method; JBIG; automatic discrimination; bilevel document image compression; image compression methods; image storage; layout information; local image context; nontext zones; performance; pictures; pixel prediction; printed text; real-world image corpus; run-length coding; scanned text; structural information; text zones; Bibliographies; Computer science; Image coding; Image databases; Image segmentation; Learning systems; Modems; Pixel; Robustness; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference, 1996. DCC '96. Proceedings
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Print_ISBN :
0-8186-7358-3
Type :
conf
DOI :
10.1109/DCC.1996.488374
Filename :
488374
Link To Document :
بازگشت