DocumentCode :
2540422
Title :
Memory hierarchy considerations for fast transpose and bit-reversals
Author :
Gatlin, Kang Su ; Carter, Larry
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
fYear :
1999
fDate :
9-13 Jan 1999
Firstpage :
33
Lastpage :
42
Abstract :
This paper explores the interplay between algorithm design and a computer´s memory hierarchy. Matrix transpose and the bit-reversal reordering are important scientific subroutines which often exhibit severe performance degradation due to cache and TLB associativity problems. We give lower bounds that show for typical memory hierarchy designs, extra data movement is unavoidable. We also prescribe characteristics of various levels of the memory hierarchy needed to perform efficient bit-reversals. Insight gained from our analysis leads to the design of a near optimal bit-reversal algorithm. This Cache Optimal Bit Reverse Algorithm (COBRA) is implemented on the Digital Alpha 21164, Sun Ultrasparc 2, and IBM Power2. We show that COBRA is near optimal with respect to execution time on these machines and performs much better than previous best known algorithms
Keywords :
cache storage; matrix algebra; subroutines; Cache Optimal Bit Reverse Algorith; Digital Alpha 21164; IBM Power2; Sun Ultrasparc 2; algorithm design; bit-reversal reordering; computer memory hierarchy; execution time; extra data movement; fast transpose; matrix transpose; near optimal bit-reversal algorithm; performance degradation; scientific subroutines; Algorithm design and analysis; Application software; Arithmetic; Computer science; Drives; Linear algebra; Optimizing compilers; Ores; Read only memory; Sun;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Computer Architecture, 1999. Proceedings. Fifth International Symposium On
Conference_Location :
Orlando, FL
Print_ISBN :
0-7695-0004-8
Type :
conf
DOI :
10.1109/HPCA.1999.744320
Filename :
744320
Link To Document :
بازگشت