Title :
A two-stage codebook building method using fast WAN
Author :
Balado Pumarino, F. ; Flórez, Óscar W Márquez
Author_Institution :
Dept. of Commun. Technol., Vigo Univ., Spain
Abstract :
Pattern-matching based document compression systems rely on finding a small set of patterns that can be used to represent the whole document. When analyzing and comparing this kind of system two factors have to be considered: the compression rate attained and the speed and associated complexity of the codebook building. In order to reduce the computational burden of the pattern matching operation while keeping a good compression ratio, we propose a new fast algorithm to carry out a WAN (weighted AND-NOT) matching process. Thus, codebook building is performed in two stages: the first step is based on FWAN (fast WAN) with a loose threshold; in the second one a more accurate but slower method (CTM, EPM) is applied over the initial approximate codebook. This screening greatly reduces the search space for the clustering procedure implicit in obtaining the library without altering the compression ratio. Experimental results show a very good speed performance for this new algorithm: at least three times faster than the usual WAN
Keywords :
computational complexity; data compression; document image processing; feature extraction; image coding; image matching; pattern matching; search problems; table lookup; CTM; EPM; WAN matching process; clustering procedure; compression rate; computational burden; fast WAN; initial approximate codebook; pattern-matching based document compression; search space; textual image compression; two-stage codebook building method; weighted AND-NOT; Clustering algorithms; Communications technology; Image coding; Image reconstruction; Libraries; Pattern matching; Pressing; Propagation losses; Wide area networks;
Conference_Titel :
Image Analysis and Processing, 1999. Proceedings. International Conference on
Conference_Location :
Venice
Print_ISBN :
0-7695-0040-4
DOI :
10.1109/ICIAP.1999.797735