Title :
An Efficient SIMD Architecture with Parallel Memory for 2D Cosine Transforms of Video Coding
Author :
Peng, Jianying ; Qin, Xing ; Li, Dexian ; Yan, Xiaolang ; Chen, Xiexiong
Author_Institution :
Zhejiang Univ., Hangzhou
Abstract :
This paper proposes an efficient SIMD architecture with parallel memory for 2D cosine transforms of multiple video standards. A novel parallel memory scheme is employed to provide conflict-free parallel access in both horizontal and vertical directions with the successive or even/odd mode, as well as to eliminate data permutation and matrix transposition. Furthermore, application specific instructions are presented to accelerate the transform kernels, such as butterfly and rotate operations with scaling, rounding and clipping. The simulation results show that proposed architecture achieves significant performance improvement with low hardware cost of 3.2 K equivalent gate count for parallel memory subsystem (not including SRAMs) and 19.8 K for arithmetic units@250 MHz in 0.18 mum process.
Keywords :
discrete cosine transforms; parallel architectures; video coding; 2D discrete cosine transform; SIMD architecture; butterfly operation; conflict-free parallel access; multiple video standard; parallel memory subsystem; rotate operation; video coding; Algorithm design and analysis; Arithmetic; Automatic voltage control; Costs; Discrete cosine transforms; Discrete transforms; Hardware; Memory architecture; Very large scale integration; Video coding;
Conference_Titel :
Application-specific Systems, Architectures and Processors, 2007. ASAP. IEEE International Conf. on
Conference_Location :
Montreal, Que.
Print_ISBN :
978-1-4244-1026-2
Electronic_ISBN :
2160-0511
DOI :
10.1109/ASAP.2007.4429990