• DocumentCode
    688255
  • Title

    Accelerating Applications Using GPUs on Embedded Systems and Mobile Devices

  • Author

    Miaoqing Huang ; Chenggang Lai

  • Author_Institution
    Dept. of Comput. Sci. & Comput. Eng., Univ. of Arkansas, Fayetteville, AR, USA
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    1031
  • Lastpage
    1038
  • Abstract
    Graphics processing units (GPUs) are capable of achieving remarkable performance improvements for a broad range of applications. However, they have not been widely adopted in embedded systems and mobile devices as accelerators mainly due to their relatively higher power consumption compared with embedded microprocessors. In this work, we conduct a comprehensive analysis regarding the feasibility and potential of accelerating applications using GPUs in low-power domains. We use two different categories of benchmarks: (1) the Level 3 BLAS subroutines, and (2) the computer vision algorithms, i.e., mean shift image segmentation and scale-invariant feature transform (SIFT). We carried out our experiments on the Nvidia CARMA development kit, which consists of a Nvidia Tegra 3 quad-core CPU and a Nvidia Quadro 1000M GPU. It is found that the GPU can deliver a remarkable performance speedup compared with the CPU while using a significantly less energy for most benchmarks. Further we propose a hybrid approach to developing applications on platform with GPU accelerators. This approach optimally distributes workload between the parallel GPU and the sequential CPU to achieve the best performance while using the least energy.
  • Keywords
    computer vision; embedded systems; graphics processing units; low-power electronics; multiprocessing systems; GPU accelerators; GPUs; Nvidia CARMA development kit; Nvidia Quadro 1000M GPU; Nvidia Tegra 3 quad-core CPU; SIFT; computer vision algorithms; embedded microprocessors; embedded systems; graphics processing units; level 3 BLAS subroutines; low-power domains; mean shift image segmentation; mobile devices; parallel GPU; power consumption; scale-invariant feature transform; sequential CPU; Algorithms; Embedded systems; Graphics processing units; Image segmentation; Mobile handsets; Performance evaluation; Power demand;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.146
  • Filename
    6832028