Title :
Analyzing communication models for distributed thread-collaborative processors in terms of energy and time
Author :
Klenk, Benjamin ; Oden, Lena ; Froning, Holger
Author_Institution :
Inst. of Comput. Eng. Heidelberg, Univ. of Heidelberg, Heidelberg, Germany
Abstract :
Accelerated computing has become pervasive for increasing the computational power and energy efficiency in terms of GFLOPs/Watt. For application areas with highest demands, for instance high performance computing, data warehousing and high performance analytics, accelerators like GPUs or Intel´s MICs are distributed throughout the cluster. Since current analyses and predictions show that data movement will be the main contributor to energy consumption, we are entering an era of communication-centric heterogeneous systems that are operating with hard power constraints. In this work, we analyze data movement optimizations for distributed heterogeneous systems based on CPUs and GPUs. Thread-collaborative processors like GPUs differ significantly in their execution model from generalpurpose processors like CPUs, but available communication models are still designed and optimized for CPUs. Similar to heterogeneity in processing, heterogeneity in communication can have a huge impact on energy and time. To analyze this impact, we use multiple workloads with distinct properties regarding computational intensity and communication characteristics. We show for which workloads tailored communication models are essential, not only reducing execution time but also saving energy. Exposing the impact in terms of energy and time for communication-centric heterogeneous systems is crucial for future optimizations, and this work is a first step in this direction.
Keywords :
data warehouses; graphics processing units; parallel processing; power aware computing; CPU; GPU; Intel MIC; accelerated computing; communication characteristics; communication models; communication-centric heterogeneous systems; computational intensity; computational power; data movement; data movement optimizations; data warehousing; distributed thread-collaborative processors; energy efficiency; general-purpose processors; hard power constraints; high performance analytics; instance high performance computing; tailored communication models; Bandwidth; Benchmark testing; Computational modeling; Data transfer; Graphics processing units; Instruction sets;
Conference_Titel :
Performance Analysis of Systems and Software (ISPASS), 2015 IEEE International Symposium on
Conference_Location :
Philadelphia, PA
DOI :
10.1109/ISPASS.2015.7095817