• DocumentCode
    1914964
  • Title

    Improving Energy Efficiency through Parallelization and Vectorization on Intel Core i5 and i7 Processors

  • Author

    Cebrian, Juan M. ; Natvig, Lasse ; Meyer, Jan Christian

  • Author_Institution
    Dept. of Comput. & Inf. Sci. (IDI, NTNU, Trondheim, Norway
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    675
  • Lastpage
    684
  • Abstract
    Driven by the utilization wall and the Dark Silicon effect, energy efficiency has become a key research area in microprocessor design. Vectorization, parallelization, specialization and heterogeneity are the key design points to deal with the utilization wall. Heterogeneous architectures are enhanced with architectural optimizations, such as vectorization, to further increase the energy efficiency of the processor, reducing the number of instructions that go through the pipeline and leveraging the usage of the memory hierarchy. AMD® FusionTM or Intel Core i5 and i7 are commercial examples of this new generation of microprocessors. Still, there is a question to be answered: How can software developers maximize energy efficiency of these architectures? In this paper, we evaluate the energy efficiency of different processors from the Intel Core i5 and i7 family, using selected benchmarks from the PARSEC suite with variable core counts and vectorization techniques to quantify energy efficiency under the Thermal Design Power (TDP). Results show that software developers should prioritize vectorization over parallelization whenever possible, as it is much better in terms of energy efficiency. When using vectorization and parallelization simultaneously, scalability of the application can be reduced drastically, and may require different development strategies to maximize resource utilization in order to increase energy efficiency. This is especially true in the server market, where we can find more than one processor per board. Finally, when comparing on-chip and “at the wall” energy savings, we can see variations from 5 to 20%, depending on the benchmark and system. This high variability shows the need to develop a more detailed model to predict system power based on on-chip power information.
  • Keywords
    energy conservation; microprocessor chips; multiprocessing systems; parallel processing; power aware computing; resource allocation; AMD Fusion; Intel Core i5 processor; Intel Core i7 processor; PARSEC suite; TDP; application scalability; core count; dark silicon effect; energy efficiency; energy savings; heterogeneity; microprocessor design; on-chip power information; parallelization; resource utilization; specialization; thermal design power; utilization wall; vectorization; AVX; Energy Efficiency; PARSEC; SSE; Vectorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
  • Conference_Location
    Salt Lake City, UT
  • Print_ISBN
    978-1-4673-6218-4
  • Type

    conf

  • DOI
    10.1109/SC.Companion.2012.93
  • Filename
    6495875