• DocumentCode
    2155303
  • Title

    SURFEX: A 57fps 1080P resolution 220mW silicon implementation for simplified speeded-up robust feature with 65nm process

  • Author

    Leibo Liu ; Weilong Zhang ; Chenchen Deng ; Shouyi Yin ; Shanshan Cai ; Shaojun Wei

  • Author_Institution
    Nat. Lab. for Inf. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2013
  • fDate
    22-25 Sept. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Speeded Up Robust Feature(SURF) is widely used in computer vision applications. In many recent applications like mobile devices and vision sensor network, it is extremely difficult to meet both the performance and power consumption requirements of SURF implementations, especially for CPU, GPU, DSP or FPGA based solutions. In this paper, the SURF algorithm is simplified and optimized for hardware implementation. To increase the throughput, procedures like orientation assignment and descriptor extraction are re-organized while maintaining enough accuracy; the memory accesses have also been improved to increase the bandwidth and reduce repeated data accesses; the workload of each stage in the pipeline is analyzed and balanced to reduce the pipeline bubble. Furthermore, a method called Word Length Reduction (WLR) is adopted to compress the integral image, which reduces the on-chip memory by 40%. In addition to that, the corresponding power consumptions are reduced significantly. The Simplified SURF is implemented onto a 3.4×4.0 mm2 chip called SURFEX using TSMC 65nm process. The chip is able to process 57 frames of 1080p(1920×1080) video per second with a 200MHz working frequency while dissipating 220mW. This throughput is 6 times of the ones reported in the latest literatures and the power consumption is less than half of the most outstanding implementations.
  • Keywords
    computer vision; elemental semiconductors; image resolution; power consumption; silicon; system-on-chip; CPU; DSP; FPGA; GPU; SURF algorithm; SURF implementations; SURFEX; Si; TSMC process; WLR; computer vision applications; descriptor extraction; hardware implementation; integral image; memory accesses; mobile devices; on-chip memory; orientation assignment; pipeline bubble; power 220 mW; power consumption requirements; silicon implementation; simplified SURF; size 65 nm; speeded-up robust feature; vision sensor network; word length reduction; Clocks; Computer architecture; Feature extraction; Hardware; Power demand; Robustness; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Custom Integrated Circuits Conference (CICC), 2013 IEEE
  • Conference_Location
    San Jose, CA
  • Type

    conf

  • DOI
    10.1109/CICC.2013.6658435
  • Filename
    6658435