• DocumentCode
    1680451
  • Title

    SIFT implementation and optimization for multi-core systems

  • Author

    Zhang, Qi ; Chen, Yurong ; Zhang, Yimin ; Xu, Yinlong

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sci. & Technol. of China, Hefei
  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Scale invariant feature transform (SIFT) is an approach for extracting distinctive invariant features from images, and it has been successfully applied to many computer vision problems (e.g. face recognition and object detection). However, the SIFT feature extraction is compute-intensive, and a real-time or even super-real-time processing capability is required in many emerging scenarios. Nowadays, with the multi- core processor becoming mainstream, SIFT can be accelerated by fully utilizing the computing power of available multi-core processors. In this paper, we propose two parallel SIFT algorithms and present some optimization techniques to improve the implementation ´s performance on multi-core systems. The result shows our improved parallel SIFT implementation can process general video images in super-real-time on a dual-socket, quad-core system, and the speed is much faster than the implementation on GPUs. We also conduct a detailed scalability and memory performance analysison the 8-core system and on a 32-core chip multiprocessor (CMP) simulator. The analysis helps us identify possible causes of bottlenecks, and we suggest avenues for scalability improvement to make this application more powerful on future large-scale multi- core systems.
  • Keywords
    computer vision; feature extraction; image matching; microprocessor chips; 32-core chip multiprocessor; SIFT feature extraction; computer vision; memory performance analysis; multi-core systems; scale invariant feature transform; Acceleration; Analytical models; Computer vision; Face recognition; Feature extraction; Large-scale systems; Multicore processing; Object detection; Performance analysis; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
  • Conference_Location
    Miami, FL
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-1693-6
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2008.4536131
  • Filename
    4536131