DocumentCode :
167406
Title :
Exploiting DMA for Performance and Energy Optimized STREAM on a DSP
Author :
Netzer, Gilbert ; Johnsson, Lennart ; Ahlin, Daniel ; Stotzer, Eric ; Varis, Pekka ; Laure, Erwin
Author_Institution :
KTH R. Inst. of Technol., Stockholm, Sweden
fYear :
2014
fDate :
19-23 May 2014
Firstpage :
805
Lastpage :
814
Abstract :
Energy efficiency is of major concern in HPC. DSP architectures have the potential to offer highly competitive energy efficiency for applications requiring 64-bit floating-point precision. For STREAM, we achieved 1.47GB/J energy efficiency and 96% DDR3 memory bandwidth utilization on the Texas Instruments TMS320C6678 DSP by using its DMA engines for prefetching to avoid cache misses, which cause pipeline stalls in the DSP´s cores, and to prevent write-allocate loads, which would significantly reduce performance. The DMA engines were also used to coordinate the DSPs cores and schedule main memory accesses to improve DDR3 bandwidth utilization. We briefly describe the instrumentation that we designed and implemented for accurate measurement of the core-related, on-chip memory, and DDR3 power consumption and the effectiveness of the DSP´s power saving mechanisms to trade-off performance and energy efficiency.
Keywords :
digital signal processing chips; energy conservation; floating point arithmetic; optimisation; parallel processing; DDR3 memory bandwidth utilization; DMA engines; HPC; STREAM optimization techniques; Texas Instruments TMS320C6678 DSP; energy efficiency; floating-point precision; Bandwidth; Digital signal processing; Engines; Multicore processing; Pipeline processing; Prefetching; System-on-chip; STREAM benchmark; digital signal processor; direct memory access; energy efficiency; high-performance computing; power measurement; streaming memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4799-4117-9
Type :
conf
DOI :
10.1109/IPDPSW.2014.92
Filename :
6969464
Link To Document :
بازگشت