DocumentCode
688255
Title
Accelerating Applications Using GPUs on Embedded Systems and Mobile Devices
Author
Miaoqing Huang ; Chenggang Lai
Author_Institution
Dept. of Comput. Sci. & Comput. Eng., Univ. of Arkansas, Fayetteville, AR, USA
fYear
2013
fDate
13-15 Nov. 2013
Firstpage
1031
Lastpage
1038
Abstract
Graphics processing units (GPUs) are capable of achieving remarkable performance improvements for a broad range of applications. However, they have not been widely adopted in embedded systems and mobile devices as accelerators mainly due to their relatively higher power consumption compared with embedded microprocessors. In this work, we conduct a comprehensive analysis regarding the feasibility and potential of accelerating applications using GPUs in low-power domains. We use two different categories of benchmarks: (1) the Level 3 BLAS subroutines, and (2) the computer vision algorithms, i.e., mean shift image segmentation and scale-invariant feature transform (SIFT). We carried out our experiments on the Nvidia CARMA development kit, which consists of a Nvidia Tegra 3 quad-core CPU and a Nvidia Quadro 1000M GPU. It is found that the GPU can deliver a remarkable performance speedup compared with the CPU while using a significantly less energy for most benchmarks. Further we propose a hybrid approach to developing applications on platform with GPU accelerators. This approach optimally distributes workload between the parallel GPU and the sequential CPU to achieve the best performance while using the least energy.
Keywords
computer vision; embedded systems; graphics processing units; low-power electronics; multiprocessing systems; GPU accelerators; GPUs; Nvidia CARMA development kit; Nvidia Quadro 1000M GPU; Nvidia Tegra 3 quad-core CPU; SIFT; computer vision algorithms; embedded microprocessors; embedded systems; graphics processing units; level 3 BLAS subroutines; low-power domains; mean shift image segmentation; mobile devices; parallel GPU; power consumption; scale-invariant feature transform; sequential CPU; Algorithms; Embedded systems; Graphics processing units; Image segmentation; Mobile handsets; Performance evaluation; Power demand;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location
Zhangjiajie
Type
conf
DOI
10.1109/HPCC.and.EUC.2013.146
Filename
6832028
Link To Document