• DocumentCode
    1954761
  • Title

    Efficient parallel merge sort for fixed and variable length keys

  • Author

    Davidson, Andrew ; Tarjan, David ; Garland, Michael ; Owens, John D.

  • Author_Institution
    Univ. of California, Davis, Davis, CA, USA
  • fYear
    2012
  • fDate
    13-14 May 2012
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    We design a high-performance parallel merge sort for highly parallel systems. Our merge sort is designed to use more register communication (not shared memory), and does not suffer from over-segmentation as opposed to previous comparison based sorts. Using these techniques we are able to achieve a sorting rate of 250 MKeys/sec, which is about 2.5 times faster than Thrust merge sort performance, and 70% faster than non-stable state-of-the-art GPU merge sorts. Building on this sorting algorithm, we develop a scheme for sorting variable-length key/value pairs, with a special emphasis on string keys. Sorting non-uniform, unaligned data such as strings is a fundamental step in a variety of algorithms, yet it has received comparatively little attention. To our knowledge, our system is the first published description of an efficient string sort for GPUs. We are able to sort strings at a rate of 70 MStrings/s on an NVidia GTX 580 on one dataset, and up to 1.25 GB/s on another dataset.
  • Keywords
    graphics processing units; parallel processing; sorting; GPU merge sorts; MStrings/s; NVidia GTX 580; efficient parallel merge sort; fixed length keys; shared memory; sorting algorithm; variable length keys; Abstracts; Registers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Parallel Computing (InPar), 2012
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    978-1-4673-2632-2
  • Electronic_ISBN
    978-1-4673-2631-5
  • Type

    conf

  • DOI
    10.1109/InPar.2012.6339592
  • Filename
    6339592