Title :
Double Throughput Multiply-Accumulate unit for FlexCore processor enhancements
Author :
Hoang, Tung Thanh ; Själander, Magnus ; Larsson-Edefors, Per
Author_Institution :
Dept. of Comput. Sci. & Eng., Chalmers Univ. of Technol., Gothenburg, Sweden
Abstract :
As a simple five-stage General-Purpose Processor (GPP), the baseline FlexCore processor has a limited set of datapath units. By utilizing a flexible datapath interconnect and a wide control word, a FlexCore processor is explicitly designed to support integration of special units that, on demand, can accelerate certain data-intensive applications. In this paper, we propose the integration of a novel Double Throughput Multiply-Accumulate (DTMAC) unit, whose different operating modes allow for on-thefly optimization of computational precision. For the two EEMBC benchmarks considered, the FlexCore processor performance is significantly enhanced when one DTMAC accelerator is included, translating into reduced execution time and energy dissipation. In comparison to the 32-bit GPP reference, the accelerated 32-bit FlexCore processor shows a 4.37times improvement in execution time and a 3.92times reduction in energy dissipation, for a benchmark with many consecutive 16-bit MAC operations.
Keywords :
microprocessor chips; system-on-chip; FlexCore processor enhancements; datapath units; double throughput multiply-accumulate unit; energy dissipation reduction; flexible datapath interconnect; general-purpose processor; Acceleration; Computer science; Data engineering; Energy dissipation; Integrated circuit interconnections; LAN interconnection; Microarchitecture; Process design; System-on-a-chip; Throughput;
Conference_Titel :
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
Conference_Location :
Rome
Print_ISBN :
978-1-4244-3751-1
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2009.5161212