DocumentCode :
124078
Title :
Comparing soft and hard vector processing in FPGA-based embedded systems
Author :
Soh Jun Jie ; Kapre, Nachiket
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2014
fDate :
2-4 Sept. 2014
Firstpage :
1
Lastpage :
7
Abstract :
Soft vector processors can augment and extend the capability of embedded hard vector processors in FPGA-based SoCs such as the Xilinx Zynq. We develop a compiler framework and an auto-tuning runtime that optimizes and executes data-parallel computation either on the scalar ARM processor, the embedded NEON engine or the Vectorblox MXP soft vector processor as appropriate. We consider computational conditions such as precision, vector length, chunk size, IO requirements under which soft vector processing can outperform scalar cores and hard vector blocks. Across a range of data-parallel benchmarks, we show that the MXP soft vector processor can outperform the NEON engine by up to 3.95× while saving 9% dynamic power (0.1W absolute). Our compilation and runtime framework is also able to outperform the gcc NEON vectorizer under certain conditions by explicit generation of NEON intrinsics and performance tuning of the auto-generated data-parallel code.
Keywords :
embedded systems; field programmable gate arrays; system-on-chip; FPGA-based SoC; FPGA-based embedded systems; MXP soft vector processor; Vectorblox MXP; Xilinx Zynq; auto-generated data-parallel code; auto-tuning runtime; computational conditions; data-parallel benchmarks; data-parallel computation; embedded NEON engine; embedded hard vector processors; hard vector blocks; hard vector processing; scalar ARM processor; scalar cores; soft vector processing; Kernel; Measurement; Radio frequency; System-on-chip; Throughput; Tuning; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field Programmable Logic and Applications (FPL), 2014 24th International Conference on
Conference_Location :
Munich
Type :
conf
DOI :
10.1109/FPL.2014.6927467
Filename :
6927467
Link To Document :
بازگشت