• DocumentCode
    385652
  • Title

    Subword sorting with versatile permutation instructions

  • Author

    Shi, Zhijie ; Lee, Ruby B.

  • Author_Institution
    Dept. of Electr. Eng., Princeton Univ., NJ, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    234
  • Lastpage
    241
  • Abstract
    Subword parallelism has succeeded in accelerating many multimedia applications. Subword permutation instructions have been proposed to efficiently rearrange subwords in or among registers. Bit-level permutation instructions have also been proposed recently for their importance in cryptography. However, important algorithms, especially those with many conditional control dependencies such as sorting, have not exploited the advantage of subword parallel instructions. In this paper, we show how one of the bit permutation instructions, GRP, can be used for fast sorting. In the process, we demonstrate the versatility of this permutation instruction for uses other than bit permutations. This versatility is important in considering the addition of a new instruction to a general-purpose processor. The results show that our sorting methods have a significant speedup even when compared with the fastest sorting algorithms. We also discuss the hardware implementation of the GRP instruction and compare its latency to a typical processor´s cycle time.
  • Keywords
    cryptography; instruction sets; microprocessor chips; multimedia computing; parallel algorithms; sorting; GRP; bit permutation instructions; bit-level permutation instructions; conditional control dependencies; cryptography; fast sorting; general-purpose processor; hardware implementation; latency; multimedia applications; processor cycle time; registers; speedup; subword parallel instructions; subword parallelism; subword permutation instructions; subword rearrangement; subword sorting; versatile permutation instructions; Computer architecture; Concurrent computing; Cryptography; Instruction sets; Microprocessors; Parallel processing; Performance gain; Reduced instruction set computing; Registers; Sorting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Design: VLSI in Computers and Processors, 2002. Proceedings. 2002 IEEE International Conference on
  • ISSN
    1063-6404
  • Print_ISBN
    0-7695-1700-5
  • Type

    conf

  • DOI
    10.1109/ICCD.2002.1106776
  • Filename
    1106776