• DocumentCode
    3144944
  • Title

    DAGuE: A Generic Distributed DAG Engine for High Performance Computing

  • Author

    Bosilca, George ; Bouteiller, Aurelien ; Danalis, Anthony ; Herault, Thomas ; Lemarinier, Pierre ; Dongarra, Jack

  • Author_Institution
    Innovative Comput. Lab., Univ. of Tennessee, Nashville, TN, USA
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    1151
  • Lastpage
    1158
  • Abstract
    The frenetic development of the current architectures places a strain on the current state-of-the-art programming environments. Harnessing the full potential of such architectures has been a tremendous task for the whole scientific computing community. We present DAGuE a generic framework for architecture aware scheduling and management of micro-tasks on distributed many-core heterogeneous architectures. Applications we consider can be represented as a Direct Acyclic Graph of tasks with labeled edges designating data dependencies. DAGs are represented in a compact, problem-size independent format that can be queried on-demand to discover data dependencies, in a totally distributed fashion. DAGuE assigns computation threads to the cores, overlaps communications and computations and uses a dynamic, fully-distributed scheduler based on cache awareness, data-locality and task priority. We demonstrate the efficiency of our approach, using several micro-benchmarks to analyze the performance of different components of the framework, and a Linear Algebra factorization as a use case.
  • Keywords
    cache storage; directed graphs; linear algebra; matrix decomposition; parallel architectures; processor scheduling; programming environments; scientific information systems; DAGuE; architecture aware scheduling; cache awareness; data dependency; data-locality; direct acyclic graph; distributed many-core heterogeneous architectures; frenetic development; fully-distributed scheduler; generic distributed DAG engine; high performance computing; linear algebra factorization; micro-benchmarks; micro-tasks management; problem-size independent format; scientific computing community; state-of-the-art programming environments; task priority; totally distributed fashion; Benchmark testing; Computer architecture; Engines; Instruction sets; Niobium; Processor scheduling; Tiles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-425-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.281
  • Filename
    6008964