Title :
Text segmentation using linear transforms
Author :
Chaddha, Navin ; Gupta, Anoop
Author_Institution :
Comput. Syst. Lab., Stanford Univ., CA, USA
fDate :
Oct. 30 1995-Nov. 1 1995
Abstract :
Block-based linear transforms have found widespread use in image and video compression. However popular compression algorithms using such transforms, such as JPEG, which are very effective in compressing continuous tone images, do not perform well on mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we propose algorithms for text segmentation based on a variety of linear transforms. We analyze the algorithms based on the accuracy and robustness of segmentation. Our results show that any of the popular linear transforms (DCT, DHT, DFT, WHT, DWT) can be used for accurate and robust text segmentation. An important practical implication of our results is that system designers can now use the same transform for both segmentation and compression, thus obtaining substantial savings in computational cost while improving quality.
Keywords :
data compression; DCT; DFT; DHT; DWT; WHT; block-based linear transforms; classification; image compression; mixed-mode images; segmentation; text segmentation; video compression; visual quality; Algorithm design and analysis; Compression algorithms; Discrete cosine transforms; Facsimile; Image coding; Image segmentation; Robustness; Software libraries; Transform coding; Video compression;
Conference_Titel :
Signals, Systems and Computers, 1995. 1995 Conference Record of the Twenty-Ninth Asilomar Conference on
Conference_Location :
Pacific Grove, CA, USA
Print_ISBN :
0-8186-7370-2
DOI :
10.1109/ACSSC.1995.540937