DocumentCode
649083
Title
Bit matrix transpose with tensor product and perfect shuffling
Author
Jui-Chieh Lin ; Yu Hen Hu
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Wisconsin Madison, Madison, WI, USA
fYear
2013
fDate
16-18 Oct. 2013
Firstpage
389
Lastpage
394
Abstract
Efficient implementation of bit-level matrix transpose operations using a new family of perfect shuffle instructions in a word-based micro-processor architecture is investigated. Using a tensor representation of the perfect shuffle index mapping, we show how bit matrix of arbitrary size may be transposed using a minimal sequence of shuffle and pack instructions, and prove the correctness of the code so generated. Specifically, a shuffle decomposition approach, and a block matrix transpose approach are proposed and the performance of these approaches are compared. Implemented with Texas Instruments C64X instruction set, significant performance enhancement is observed.
Keywords
instruction sets; matrix algebra; microprocessor chips; radio transceivers; software radio; tensors; Texas Instruments C64X instruction set; bit-level matrix transpose operations; block matrix transpose approach; minimal pack instruction sequence; minimal shuffle instruction sequence; perfect shuffle index mapping; performance enhancement; shuffle decomposition approach; software defined radio; tensor product; tensor representation; wireless communication stan- dards; wireless transceivers; word-based microprocessor architecture; Bit matrix transpose; perfect shuffle; tensor;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Systems (SiPS), 2013 IEEE Workshop on
Conference_Location
Taipei City
ISSN
2162-3562
Type
conf
DOI
10.1109/SiPS.2013.6674538
Filename
6674538
Link To Document