• DocumentCode
    3594
  • Title

    A Single-Channel Architecture for Algebraic Integer-Based 8 ,\\times, 8 2-D DCT Computation

  • Author

    Edirisuriya, A. ; Madanayake, A. ; Cintra, Renato J. ; Dimitrov, Vassil S. ; Rajapaksha, Nilanka

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Akron, Akron, OH, USA
  • Volume
    23
  • Issue
    12
  • fYear
    2013
  • fDate
    Dec. 2013
  • Firstpage
    2083
  • Lastpage
    2089
  • Abstract
    An area efficient row-parallel architecture is proposed for the real-time implementation of bivariate algebraic integer (AI) encoded 2-D discrete cosine transform (DCT) for image and video processing. The proposed architecture computes 8 × 8 2-D DCT transform based on the Arai DCT algorithm. An improved fast algorithm for AI-based 1-D DCT computation is proposed along with a single channel 2-D DCT architecture. The design improves on the four-channel AI DCT architecture that was published recently by reducing the number of integer channels to one and the number of eight-point 1-D DCT cores from five down to two. The architecture offers exact computation of 8 × 8 blocks of the 2-D DCT coefficients up to the FRS, which converts the coefficients from the AI representation to fixed-point format using the method of expansion factors. Prototype circuits corresponding to FRS blocks based on two expansion factors are realized, tested, and verified on FPGA-chip, using a Xilinx Virtex-6 XC6VLX240T device. Post place-and-route results show a 20% reduction in terms of area compared to the 2-D DCT architecture requiring five 1-D AI cores. The area-time and area-time2 complexity metrics are also reduced by 23% and 22% respectively for designs with eight-bit input word length. The digital realizations are simulated up to place and route for ASICs using 45 nm CMOS standard cells. The maximum estimated clock rate is 951 MHz for the CMOS realizations indicating 7.608·109 pixels/s and a 8 × 8 block rate of 118.875 MHz.
  • Keywords
    CMOS integrated circuits; application specific integrated circuits; discrete cosine transforms; field programmable gate arrays; parallel architectures; video signal processing; 2D DCT computation; ASIC; Arai DCT algorithm; CMOS realizations; CMOS standard cells; FPGA-chip; Xilinx Virtex-6 XC6VLX240T device; bivariate algebraic integer; clock rate; discrete cosine transform; fixed-point format; four-channel AI DCT architecture; image processing; place-and-route; real-time implementation; row-parallel architecture; single channel 2D DCT architecture; single-channel architecture; size 45 nm; video processing; Application specific integrated circuits; Discrete cosine transforms; Image reconstruction; Parallel architectures; Prototypes; Real-time systems; Algebraic integers; discrete cosine transform (DCT); expansion factors;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2013.2270397
  • Filename
    6544611