Title :
Accelerating Applications Using GPUs on Embedded Systems and Mobile Devices
Author :
Miaoqing Huang ; Chenggang Lai
Author_Institution :
Dept. of Comput. Sci. & Comput. Eng., Univ. of Arkansas, Fayetteville, AR, USA
Abstract :
Graphics processing units (GPUs) are capable of achieving remarkable performance improvements for a broad range of applications. However, they have not been widely adopted in embedded systems and mobile devices as accelerators mainly due to their relatively higher power consumption compared with embedded microprocessors. In this work, we conduct a comprehensive analysis regarding the feasibility and potential of accelerating applications using GPUs in low-power domains. We use two different categories of benchmarks: (1) the Level 3 BLAS subroutines, and (2) the computer vision algorithms, i.e., mean shift image segmentation and scale-invariant feature transform (SIFT). We carried out our experiments on the Nvidia CARMA development kit, which consists of a Nvidia Tegra 3 quad-core CPU and a Nvidia Quadro 1000M GPU. It is found that the GPU can deliver a remarkable performance speedup compared with the CPU while using a significantly less energy for most benchmarks. Further we propose a hybrid approach to developing applications on platform with GPU accelerators. This approach optimally distributes workload between the parallel GPU and the sequential CPU to achieve the best performance while using the least energy.
Keywords :
computer vision; embedded systems; graphics processing units; low-power electronics; multiprocessing systems; GPU accelerators; GPUs; Nvidia CARMA development kit; Nvidia Quadro 1000M GPU; Nvidia Tegra 3 quad-core CPU; SIFT; computer vision algorithms; embedded microprocessors; embedded systems; graphics processing units; level 3 BLAS subroutines; low-power domains; mean shift image segmentation; mobile devices; parallel GPU; power consumption; scale-invariant feature transform; sequential CPU; Algorithms; Embedded systems; Graphics processing units; Image segmentation; Mobile handsets; Performance evaluation; Power demand;
Conference_Titel :
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location :
Zhangjiajie
DOI :
10.1109/HPCC.and.EUC.2013.146