Title :
Text segmentation in mixed-mode images
Author :
Chaddha, Navin ; Sharma, Rosen ; Agrawal, Avneesh ; Gupta, Anoop
Author_Institution :
Comput. Syst. Lab., Stanford Univ., CA, USA
fDate :
31 Oct-2 Nov 1994
Abstract :
Block based algorithms have found widespread use in image and video compression. However, popular algorithms such as JPEG, which are very effective in compressing continuous tone images, do not perform well with mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we analyze and compare four methods for block classification in mixed mode images, namely variance, absolute-deviation, edge, and DCT based methods. Our evaluation of each scheme is based on the accuracy of segmentation, robustness across different types of images and sensitivity to the threshold used for segmentation. Our results show that DCT based segmentation offers the best accuracy and robustness. Another advantage of DCT is that it is compatible with standards like JPEG, MPEG and H.261
Keywords :
data compression; discrete cosine transforms; edge detection; image classification; image coding; image segmentation; transform coding; visual databases; DCT based methods; H.261; JPEG; MPEG; absolute-deviation; block based algorithms; color facsimile; compression parameters; continuous tone images; digital libraries; edge based methods; edge detection; educational videos; image compression; image database; mixed-mode images; segmentation accuracy; standards compatibility; text segmentation; threshold sensitivity; variance; video compression; Analysis of variance; Discrete cosine transforms; Facsimile; Image analysis; Image coding; Image segmentation; Robustness; Software libraries; Transform coding; Video compression;
Conference_Titel :
Signals, Systems and Computers, 1994. 1994 Conference Record of the Twenty-Eighth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
0-8186-6405-3
DOI :
10.1109/ACSSC.1994.471679