Title :
Analysis and architecture design of block-coding engine for EBCOT in JPEG 2000
Author :
Lian, Chung-Jr ; Chen, Kuan-Fu ; Chen, Hong-Hui ; Chen, Liang-Gee
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fDate :
3/1/2003 12:00:00 AM
Abstract :
Embedded block coding with optimized truncation (EBCOT) is the most important technology in the latest image-coding standard, JPEG 2000. The hardware design of the block-coding engine in EBCOT is critical because the operations are bit-level processing and occupy more than half of the computation time of the whole compression process. A general purpose processor (GPP) is, therefore, very inefficient to process these operations. We present detailed analysis and dedicated hardware architecture of the block-coding engine to execute the EBCOT algorithm efficiently. The context formation process in EBCOT is analyzed to get an insight into the characteristics of the operation. A column-based architecture and two speed-up methods, sample skipping (SS) and group-of-column skipping (GOCS), for the context generation are then proposed. As for arithmetic encoder design, the pipeline and look-ahead techniques are used to speed up the processing. It is shown that about 60% of the processing time is reduced compared with sample-based straightforward implementation. A test chip is designed and the simulation results show that it can process 4.6 million pixels image within 1 s, corresponding to 2400 × 1800 image size, or CIF (352 × 288) 4 : 2 : 0 video sequence with 30 frames per second at 50-MHz working frequency.
Keywords :
arithmetic codes; code standards; data compression; discrete wavelet transforms; image coding; image sampling; image sequences; optimisation; pipeline processing; telecommunication standards; 50 MHz; EBCOT algorithm; JPEG 2000 image-coding standard; JPEG image-coding standard; architecture design; arithmetic encoder design; bit-level; block-coding engine; column-based architecture; computation time; context formation; context generation; discrete wavelet transform; embedded block coding with optimized truncation; frequency; general purpose processor; group-of-column skipping; hardware architecture; look-ahead technique; pipeline technique; pixels; processing time; sample skipping; sample-based implementation; simulation results; speed-up methods; test chip; video sequence; Algorithm design and analysis; Arithmetic; Block codes; Computer architecture; Engines; Hardware; Image coding; Pipelines; Testing; Transform coding;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2003.809833