• DocumentCode
    2092664
  • Title

    A Partitioning Algorithm for Parallel Sorting on Distributed Memory Systems

  • Author

    Hofmann, Michael ; Rünger, Gudula

  • Author_Institution
    Dept. of Comput. Sci., Chemnitz Univ. of Technol., Chemnitz, Germany
  • fYear
    2011
  • fDate
    2-4 Sept. 2011
  • Firstpage
    402
  • Lastpage
    411
  • Abstract
    Parallel sorting methods for distributed memory systems often use partitioning algorithms to prepare the redistribution of data items. This article proposes a partitioning algorithm that calculates a redistribution specified by the number of data items to be finally located on each process. This partitioning algorithm can also be used for data items with weights, which might express a computational load to be expected, and to produce a redistribution with an individual accumulated weight of data items specified for each process. Another important feature is that data sets with duplicated data keys can be handled. Parallel sorting with those properties is often needed for parallel scientific application codes, such as particle simulations, in which the dynamics of the simulated system may destroy locality and load balance required for an efficient computation. It is applied to random sample data and to a particle simulation code requiring a sorting. Performance results have been obtained on an IBM Blue Gene/P platform with up to 32768 cores. The results show that the proposed parallel sorting method performs well in comparison to other existing algorithms.
  • Keywords
    data analysis; distributed memory systems; parallel algorithms; resource allocation; set theory; sorting; IBM Blue Gene-P platform; computational load; data item redistribution; data sets; distributed memory system; duplicated data key; individual accumulated weight; load balance; parallel scientific application code; parallel sorting method; particle simulation code; partitioning algorithm; random sample data; Distributed databases; Load management; Load modeling; Partitioning algorithms; Process control; Sorting; Upper bound; data redistribution; load balancing; parallel sorting; particle simulations; performance optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications (HPCC), 2011 IEEE 13th International Conference on
  • Conference_Location
    Banff, AB
  • Print_ISBN
    978-1-4577-1564-8
  • Electronic_ISBN
    978-0-7695-4538-7
  • Type

    conf

  • DOI
    10.1109/HPCC.2011.59
  • Filename
    6063018