DocumentCode :
234595
Title :
Exploiting ILP, DLP, TLP, and MPI to accelerate matrix multiplication on Xeon processors
Author :
Soliman, Mostafa I. ; Ahmed, Fatma S.
Author_Institution :
Comput. Sci. & Inf. Dept., Taibah Univ., Al-Madinah, Saudi Arabia
fYear :
2014
fDate :
19-20 April 2014
Firstpage :
1
Lastpage :
6
Abstract :
Matrix multiplication is one of the most important kernels used in the dense linear algebra codes. It is a computational intensive kernel that demands exploiting all available forms of parallelism to improve its performance. In this paper, ILP, DLP, TLP, and MPI are exploited to accelerate the execution of matrix multiplication on a cluster of computers with Intel Xeon processors. In addition, the Strassen´s algorithm, which reduces the arithmetic operations of matrix multiplication from O(n3) to O(n2.807), is implemented using multi-threading, SIMD, blocking, and MPI techniques. Our results show that the average speedups of the traditional matrix multiplication algorithm on large matrices (from 6000×6000 to 9000×9000) are 14.9, 24.1, 24, 22.8 and 20.7 on 2, 4, 6, 8, and 10 computers, respectively. However, on the Strassen´s algorithm, the average speedups are 3.9, 6.8, 6.8, and 10.3 on 2, 4, 6, and 7 computers, respectively, by applying the same parallel processing techniques.
Keywords :
computational complexity; matrix multiplication; message passing; multi-threading; parallel processing; DLP; ILP; Intel Xeon processors; MPI; SIMD; Strassen algorithm; TLP; computational intensive kernel; data-level parallelism; dense linear algebra codes; instruction-level parallelism; matrix multiplication; message passing interface; multithreading; parallel processing techniques; single instruction for processing multiple data; thread-level parallelism; Algebra; MPI; Matrix multiplication; SIMD; Strassen´s algorithm; blocking; multi-threading;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering and Technology (ICET), 2014 International Conference on
Conference_Location :
Cairo
Type :
conf
DOI :
10.1109/ICEngTechnol.2014.7016779
Filename :
7016779
Link To Document :
بازگشت