• DocumentCode
    688301
  • Title

    Parallelization and Optimization of SIFT on GPU Using CUDA

  • Author

    Zhou Yonglong ; Mei Kuizhi ; Ji Xiang ; Dong Peixiang

  • Author_Institution
    Xian Jiaotong Univ., Xian, China
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    1351
  • Lastpage
    1358
  • Abstract
    Scale-invariant feature transform (SIFT) based feature extraction algorithm is widely applied to extract features from images, and it is very attractive to accelerate these SIFT based algorithms on GPU. In this paper, we present several parallel computing strategies, implement and optimize the SIFT algorithm using CUDA programming model on GPU. Each stage of SIFT is analyzed in detail to choose the parallel strategy. On the basis of the elementary CUDA-SIFT and CUDA architecture, we optimize the implementation from several aspects to speedup the CUDA-SIFT. Experimental results demonstrate that our implementation after optimization is 2.5 times faster than previous optimization, and our CUDA based SIFT can run at the speed of 20 frames per second on most images with 1280 × 960 resolution in the test. Using 1920 ×1440 image to test, we have obtained a speed of 11 frames per second on average, which is about 60 times faster than the CPU implementation of SIFT. In short, our implementation obtains appropriate accuracy and higher efficiency compared to CPU implementations and other GPU implementations, which is attributed to our dedicated optimization strategies.
  • Keywords
    graphics processing units; parallel architectures; parallel processing; transforms; CPU implementation; CUDA architecture; CUDA programming model; CUDA-SIFT; GPU; SIFT optimization; SIFT parallelization; feature extraction algorithm; parallel computing strategies; parallel strategy; scale invariant feature transform; Convolution; Feature extraction; Graphics processing units; Histograms; Instruction sets; Optimization; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.192
  • Filename
    6832074