• DocumentCode
    2768143
  • Title

    An Efficient SIMD Architecture with Parallel Memory for 2D Cosine Transforms of Video Coding

  • Author

    Peng, Jianying ; Qin, Xing ; Li, Dexian ; Yan, Xiaolang ; Chen, Xiexiong

  • Author_Institution
    Zhejiang Univ., Hangzhou
  • fYear
    2007
  • fDate
    9-11 July 2007
  • Firstpage
    260
  • Lastpage
    265
  • Abstract
    This paper proposes an efficient SIMD architecture with parallel memory for 2D cosine transforms of multiple video standards. A novel parallel memory scheme is employed to provide conflict-free parallel access in both horizontal and vertical directions with the successive or even/odd mode, as well as to eliminate data permutation and matrix transposition. Furthermore, application specific instructions are presented to accelerate the transform kernels, such as butterfly and rotate operations with scaling, rounding and clipping. The simulation results show that proposed architecture achieves significant performance improvement with low hardware cost of 3.2 K equivalent gate count for parallel memory subsystem (not including SRAMs) and 19.8 K for arithmetic units@250 MHz in 0.18 mum process.
  • Keywords
    discrete cosine transforms; parallel architectures; video coding; 2D discrete cosine transform; SIMD architecture; butterfly operation; conflict-free parallel access; multiple video standard; parallel memory subsystem; rotate operation; video coding; Algorithm design and analysis; Arithmetic; Automatic voltage control; Costs; Discrete cosine transforms; Discrete transforms; Hardware; Memory architecture; Very large scale integration; Video coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Application-specific Systems, Architectures and Processors, 2007. ASAP. IEEE International Conf. on
  • Conference_Location
    Montreal, Que.
  • ISSN
    2160-0511
  • Print_ISBN
    978-1-4244-1026-2
  • Electronic_ISBN
    2160-0511
  • Type

    conf

  • DOI
    10.1109/ASAP.2007.4429990
  • Filename
    4429990