DocumentCode
646645
Title
Re-Introduction of communication-avoiding FMM-accelerated FFTs with GPU acceleration
Author
Langston, M. Harper ; Baskaran, Mani ; Meister, B. ; Vasilache, N. ; Lethin, R.
Author_Institution
Reservoir Labs. Inc. New York, New York, NY, USA
fYear
2013
fDate
10-12 Sept. 2013
Firstpage
1
Lastpage
6
Abstract
As distributed memory systems grow larger, communication demands have increased. Unfortunately, while the costs of arithmetic operations continue to decrease rapidly, communication costs have not. As a result, there has been a growing interest in communication-avoiding algorithms for some of the classic problems in numerical computing, including communication-avoiding Fast Fourier Transforms (FFTs). A previously-developed low-communication FFT, however, has remained largely out of the picture, partially due to its reliance on the Fast Multipole Method (FMM), an algorithm that typically aids in accelerating dense computations. We have begun an algorithmic investigation and re-implementation design for the FMM-accelerated FFT, which exploits the ability to tune precision of the result (due to the mathematical nature of the FMM) to reduce power-burning communication and computation, the potential benefit of which is to reduce the energy required for the fundamental transform of digital signal processing. We reintroduce this algorithm as well as discuss new innovations for separating the distinct portions of the FMM into a CPU-dedicated process, relying on inter-processor communication for approximate interactions, and a GPU-dedicated process for dense interactions with no communication1.
Keywords
digital arithmetic; distributed memory systems; fast Fourier transforms; graphics processing units; signal processing; CPU-dedicated process; GPU acceleration; GPU-dedicated process; arithmetic operations; communication-avoiding FMM-accelerated FFT; communication-avoiding algorithms; communication-avoiding fast Fourier transforms; digital signal processing; distributed memory systems; fast multipole method; interprocessor communication; low-communication FFT; power-burning communication; Accuracy; Feedback amplifier; Graphics processing units; Noise measurement; Signal processing algorithms; Standards;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Extreme Computing Conference (HPEC), 2013 IEEE
Conference_Location
Waltham, MA
Print_ISBN
978-1-4799-1364-0
Type
conf
DOI
10.1109/HPEC.2013.6670352
Filename
6670352
Link To Document