DocumentCode :
3042737
Title :
An O(n) Time-Complexity Matrix Transpose on Torus Array Processor
Author :
Ravankar, Abhijeet A. ; Sedukhin, Stanislav G.
Author_Institution :
Grad. Sch. of Comput. Sci. & Eng., Univ. of Aizu, Aizu-Wakamatsu, Japan
fYear :
2011
fDate :
Nov. 30 2011-Dec. 2 2011
Firstpage :
242
Lastpage :
247
Abstract :
Matrix transpose is an essential operation in many applications like signal processing (ex. linear transforms) etc. and an efficient matrix transpose algorithm can speed up many applications. In this paper, we propose a new algorithm for n x n matrix transposition on array processors connected in torus network. The algorithm has O(n) time complexity. The algorithm uses matrix-matrix multiply-add (MMA) operation for transposing the matrix. We show how to align data and give algorithm for generating permutation matrices. The entire n x n matrix transposition is carried out in 5n time-steps. This approach does not require any dedicated connections for matrix transposition. Both input and output matrices are in canonical (natural and not skewed) layout. We also discuss blocked matrix transposition.
Keywords :
computational complexity; matrix algebra; parallel processing; MMA operation; O(n) time-complexity matrix transpose; array processors; blocked matrix transposition; canonical layout; input matrices; linear transforms; matrix transpose algorithm; matrix-matrix multiply-add operation; output matrices; permutation matrices; signal processing; torus array processor; torus network; Arrays; Image sensors; Layout; Registers; Routing; Sensors; Signal processing algorithms; Matrix transpose; matrix multiplication; permutation matrices; torus array processors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Networking and Computing (ICNC), 2011 Second International Conference on
Conference_Location :
Osaka
Print_ISBN :
978-1-4577-1796-3
Type :
conf
DOI :
10.1109/ICNC.2011.43
Filename :
6131813
Link To Document :
بازگشت