DocumentCode :
2176004
Title :
Implementing Parallel LU Factorization with Pipelining on a MultiCore Using OpenMP
Author :
Michailidis, Panagiotis D. ; Margaritis, Konstantinos G.
Author_Institution :
Univ. of Western Macedonia, Fiorina, Greece
fYear :
2010
fDate :
11-13 Dec. 2010
Firstpage :
253
Lastpage :
260
Abstract :
Recent developments in high performance computer architecture have a significant effect on all fields of scientific computing. Linear algebra and especially the solution of linear systems of equations lies at the heart of many applications in scientific computing. This paper describes and analyzes three parallel versions of the dense LU factorization method that is used in linear system solving on a multicore using OpenMP interface. More specifically, we present two naive parallel algorithms based on row block and row cyclic data distribution and we put special emphasis on presenting a third parallel algorithm based on the pipeline technique. Further, we propose an implementation of the pipelining technique in OpenMP. Experimental results on a multicore CPU show that the proposed OpenMP pipeline implementation achieves good overall performance compared to the other two naive parallel methods. Finally, in this work we propose a simple, fast and reasonably analytical model to predict the performance of the LU decomposition method with the pipelining technique.
Keywords :
linear algebra; message passing; multiprocessing systems; parallel algorithms; parallel architectures; LU decomposition; OpenMP interface; OpenMP pipeline; dense LU factorization; high performance computer architecture; linear algebra; linear system solving; linear systems; multicore CPU; multicore processing; naive parallel algorithm; naive parallel method; parallel LU factorization; pipelining technique; row block; row cyclic data distribution; scientific computing; third parallel algorithm; Arrays; Instruction sets; Multicore processing; Parallel algorithms; Pipeline processing; Pipelines; LU; Linear algebra; OpenMP; factorization; multicore; parallel algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Science and Engineering (CSE), 2010 IEEE 13th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-9591-7
Electronic_ISBN :
978-0-7695-4323-9
Type :
conf
DOI :
10.1109/CSE.2010.39
Filename :
5692483
Link To Document :
بازگشت