DocumentCode
2768143
Title
An Efficient SIMD Architecture with Parallel Memory for 2D Cosine Transforms of Video Coding
Author
Peng, Jianying ; Qin, Xing ; Li, Dexian ; Yan, Xiaolang ; Chen, Xiexiong
Author_Institution
Zhejiang Univ., Hangzhou
fYear
2007
fDate
9-11 July 2007
Firstpage
260
Lastpage
265
Abstract
This paper proposes an efficient SIMD architecture with parallel memory for 2D cosine transforms of multiple video standards. A novel parallel memory scheme is employed to provide conflict-free parallel access in both horizontal and vertical directions with the successive or even/odd mode, as well as to eliminate data permutation and matrix transposition. Furthermore, application specific instructions are presented to accelerate the transform kernels, such as butterfly and rotate operations with scaling, rounding and clipping. The simulation results show that proposed architecture achieves significant performance improvement with low hardware cost of 3.2 K equivalent gate count for parallel memory subsystem (not including SRAMs) and 19.8 K for arithmetic units@250 MHz in 0.18 mum process.
Keywords
discrete cosine transforms; parallel architectures; video coding; 2D discrete cosine transform; SIMD architecture; butterfly operation; conflict-free parallel access; multiple video standard; parallel memory subsystem; rotate operation; video coding; Algorithm design and analysis; Arithmetic; Automatic voltage control; Costs; Discrete cosine transforms; Discrete transforms; Hardware; Memory architecture; Very large scale integration; Video coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Application-specific Systems, Architectures and Processors, 2007. ASAP. IEEE International Conf. on
Conference_Location
Montreal, Que.
ISSN
2160-0511
Print_ISBN
978-1-4244-1026-2
Electronic_ISBN
2160-0511
Type
conf
DOI
10.1109/ASAP.2007.4429990
Filename
4429990
Link To Document