• DocumentCode
    723686
  • Title

    Hierarchical DAG Scheduling for Hybrid Distributed Systems

  • Author

    Wei Wu ; Bouteiller, Aurelien ; Bosilca, George ; Faverge, Mathieu ; Dongarra, Jack

  • Author_Institution
    Univ. of Tennessee, Knoxville, TN, USA
  • fYear
    2015
  • fDate
    25-29 May 2015
  • Firstpage
    156
  • Lastpage
    165
  • Abstract
    Accelerator-enhanced computing platforms have drawn a lot of attention due to their massive peak commutational capacity. Despite significant advances in the programming interfaces to such hybrid architectures, traditional programming paradigms struggle with mapping the resulting multi-dimensional heterogeneity and the expression of algorithm parallelism, resulting in sub-optimal effective performance. Task-based programming paradigms have the capability to alleviate some of the programming challenges on distributed hybrid many-core architectures. In this paper we take this concept a step further by showing that the potential of task-based programming paradigms can be greatly increased with minimal modification of the underlying runtime combined with the right algorithmic changes. We propose two novel recursive algorithmic variants for one-sided factorizations and describe the changes to the PaRSEC task-scheduling runtime to build a framework where the task granularity is dynamically adjusted to adapt the degree of available parallelism and kernel efficiency according to runtime conditions. Based on an extensive set of results, we show that, with one-sided factorizations, i.e. Colicky, and QR, a carefully written algorithm, supported by an adaptive tasks-based runtime, is capable of reaching a degree of performance and scalability never achieved before in distributed hybrid environments.
  • Keywords
    multiprocessing systems; parallel algorithms; parallel programming; processor scheduling; PaRSEC task-scheduling runtime; accelerator-enhanced computing platforms; adaptive tasks-based runtime; algorithm parallelism; distributed hybrid environments; distributed hybrid many-core architectures; hierarchical DAG scheduling; hybrid architectures; hybrid distributed systems; massive peak commutational capacity; multidimensional heterogeneity; one-sided factorizations; programming interfaces; recursive algorithmic variants; sub-optimal effective performance; task granularity; task-based programming paradigms; traditional programming paradigms; Graphics processing units; Kernel; Linear algebra; Processor scheduling; Programming; Runtime; GPU; PaRSEC runtime; dense linear algebra; heterogeneous architecture;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
  • Conference_Location
    Hyderabad
  • ISSN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2015.56
  • Filename
    7161505