Title : 
GPU acceleration of object detection on video stream using CUDA
         
        
            Author : 
Dawwd, Shefa A. ; Salim, Ula T.
         
        
            Author_Institution : 
Comput. Eng. Dept., Univ. of Mosul, Mosul, Iraq
         
        
        
        
        
        
            Abstract : 
Object detection is one of the important applications of the computer vision, image and video processing. However the best accuracy and fast invariant detecting function under changing object states such as (position, scale, illumination and noise) is a central aspect problem of the object detection in the video frames and images that cannot be realized by using sequential processing with a single core General Purpose Central Processing Unit (GPCPU). In this paper, to solve these problems and speed up the highly intensive calculation required, a simple and an efficient template matching algorithm of object detection is proposed. It is based on using sliding window across the video frame with applying two similarity measures as a window function: Sum of Absolute Difference (SAD) and pyramid downscale multi-resolution Sum of Absolute Difference that is called (PSAD). The implementation operation is achieved by using Graphic Processing Unit (GPU) that is based on using parallel processing technique, Data Level Parallelism (DLP), and single instruction multiple data (SIMD) operations with Compute Unified Device Architecture (CUDA). For both SAD and PSAD a comparable speedup of 161x and 97x is achieved respectively for an image size of 768×567 using MATLAB environment.
         
        
            Keywords : 
graphics processing units; image matching; object detection; parallel architectures; video streaming; CUDA; DLP; GPCPU; GPU; MATLAB environment; PSAD; SIMD operations; changing object states; compute unified device architecture; computer vision; data level parallelism; fast invariant detecting function; graphic processing unit; image processing; object detection; parallel processing technique; pyramid downscale multiresolution sum of absolute difference; similarity measures; single core general purpose central processing unit; single instruction multiple data operations; sliding window; template matching algorithm; video frames; video processing; window function; Computer architecture; Computers; Graphics processing units; Instruction sets; MATLAB; Object detection; Parallel processing; CUDA; GPU; Object detection; PSAD; SAD; sliding window; template matching; video stream;
         
        
        
        
            Conference_Titel : 
Electrical, Communication, Computer, Power, and Control Engineering (ICECCPCE), 2013 International Conference on
         
        
            Conference_Location : 
Mosul
         
        
        
            DOI : 
10.1109/ICECCPCE.2013.6998761