DocumentCode
1997187
Title
Accelerating the 3D Elastic Wave Forward Modeling on GPU and MIC
Author
Yang You ; Haohuan Fu ; Xiaomeng Huang ; Guojie Song ; Lin Gan ; Wenjian Yu ; Guangwen Yang
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear
2013
fDate
20-24 May 2013
Firstpage
1088
Lastpage
1096
Abstract
The forward modeling of wave propagation is a widely-used computational method in oil and gas exploration. Its iterative stencil loops also have broad applications in scientific computing. However, the time-consuming iterative stencil loops greatly limit the exploration efficiency. In this paper, we accelerate the forward modeling on a number of different parallel architectures such as multi-core CPUs, NVIDIA Tesla GPUs (448-core Fermi and 2496 SP-cores & 832 DP-units Kepler) and the latest Intel MIC (61-core Knights Corner). For the GPU platform, we propose a 5-slice scheme to handle the problem of limited block registers and we design two parallel strategies to explore the applicabilities of Fermi and Kepler architectures; for CPUs and MIC, the SIMD vectorization scheme plays the most important role in acceleration. Although our complex stencil poses a great challenge for the optimization on GPU, we manage to achieve 4.31x and 6.03x speedups by Fermi and Kepler compared with a parallel CPU version that runs on 12 cores. The best speedup on MIC is 3.76x over the parallel CPU version. We also give a detailed comparison between the GPU and the MIC architectures. Our analysis on their advantages and constraints could be served as scenarios for scientists and developers to find the suitable accelerators towards target applications.
Keywords
elastic waves; graphics processing units; iterative methods; multiprocessing systems; parallel architectures; 3D elastic wave forward modeling; 5-slice scheme; CPUs; Fermi architecture; GPU architecture; Kepler architecture; MIC architecture; SIMD vectorization scheme; block registers; complex stencil; gas exploration; oil exploration; parallel architecture; parallel strategies; scientific computing; target applications; time-consuming iterative stencil loops; wave propagation forward modeling; Acceleration; Computer architecture; Graphics processing units; Instruction sets; Mathematical model; Microwave integrated circuits; Registers; Kepler; Knights-Corner; forward-model; stencil;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location
Cambridge, MA
Print_ISBN
978-0-7695-4979-8
Type
conf
DOI
10.1109/IPDPSW.2013.216
Filename
6650994
Link To Document