DocumentCode :
626978
Title :
High performance 3D-FFT implementation
Author :
Nidhi, U. ; Paul, Kolin ; Hemani, Ahmed ; Kumar, Ajit
Author_Institution :
Dept. of Comput. Sci. & Eng, IIT Delhi, New Delhi, India
fYear :
2013
fDate :
19-23 May 2013
Firstpage :
2227
Lastpage :
2230
Abstract :
3D FFT is a very data and compute intensive kernel encountered in many applications. We report a high performance design and implementation of 3D-FFT on a CGRA which supports partial reconfiguration. The hardware software multi clock design uses dynamic reconfiguration to reduce the required communication bandwidth to achieve a sustained throughput of 40 GOPS on a wordsize of 48 bits. Performance metrics including overheads and speed over software for implementations of up to 256 point 3D-FFT have been presented in the paper.
Keywords :
fast Fourier transforms; logic design; parallel architectures; reconfigurable architectures; CGRA; GOPS; coarse grain reconfigurable architectures; dynamic reconfiguration; hardware-software multiclock design; high performance 3D-FFT implementation; partial reconfiguration; Computer architecture; Hardware; Kernel; Program processors; Runtime; Tiles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
Conference_Location :
Beijing
ISSN :
0271-4302
Print_ISBN :
978-1-4673-5760-9
Type :
conf
DOI :
10.1109/ISCAS.2013.6572319
Filename :
6572319
Link To Document :
بازگشت