Title :
An entropy based segmentation algorithm for computer-generated document images
Author :
Liu, Lijie ; Dong, Yan ; Song, Xiaomu ; Fan, Guoliang
Author_Institution :
Sch. of Electr. & Comput. Eng., Oklahoma State Univ., Stillwater, OK, USA
Abstract :
This paper presents an efficient compression-oriented segmentation algorithm for computer-generated document images. In this algorithm, a document image is represented in a block-based multiscale pyramid. Then, image blocks will be characterized based on their entropy values of the intensity histogram, and the entropy distribution are assumed to be Gaussian priors in this work. We will discuss two methods, i.e., off-line and online training, to estimate model parameters. We use the multiscale Bayesian estimation to refine the classification results and generate the final segmentation result, where image blocks are classified into four classes, i.e., background, text, graphic and picture. It is expected that the proposed entropy-based segmentation will be suitable for compound document compression and two training approaches apply to different applications.
Keywords :
Gaussian processes; entropy; image representation; image segmentation; parameter estimation; Gaussian priors; block-based multiscale pyramid; computer-generated document images; entropy based segmentation algorithm; image representation; intensity histogram; multiscale Bayesian estimation; off-line training; online training; Bayesian methods; Computer graphics; Density functional theory; Entropy; Histograms; Image coding; Image databases; Image segmentation; Optimization methods; Rate-distortion;
Conference_Titel :
Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7750-8
DOI :
10.1109/ICIP.2003.1247018