DocumentCode :
3144944
Title :
DAGuE: A Generic Distributed DAG Engine for High Performance Computing
Author :
Bosilca, George ; Bouteiller, Aurelien ; Danalis, Anthony ; Herault, Thomas ; Lemarinier, Pierre ; Dongarra, Jack
Author_Institution :
Innovative Comput. Lab., Univ. of Tennessee, Nashville, TN, USA
fYear :
2011
fDate :
16-20 May 2011
Firstpage :
1151
Lastpage :
1158
Abstract :
The frenetic development of the current architectures places a strain on the current state-of-the-art programming environments. Harnessing the full potential of such architectures has been a tremendous task for the whole scientific computing community. We present DAGuE a generic framework for architecture aware scheduling and management of micro-tasks on distributed many-core heterogeneous architectures. Applications we consider can be represented as a Direct Acyclic Graph of tasks with labeled edges designating data dependencies. DAGs are represented in a compact, problem-size independent format that can be queried on-demand to discover data dependencies, in a totally distributed fashion. DAGuE assigns computation threads to the cores, overlaps communications and computations and uses a dynamic, fully-distributed scheduler based on cache awareness, data-locality and task priority. We demonstrate the efficiency of our approach, using several micro-benchmarks to analyze the performance of different components of the framework, and a Linear Algebra factorization as a use case.
Keywords :
cache storage; directed graphs; linear algebra; matrix decomposition; parallel architectures; processor scheduling; programming environments; scientific information systems; DAGuE; architecture aware scheduling; cache awareness; data dependency; data-locality; direct acyclic graph; distributed many-core heterogeneous architectures; frenetic development; fully-distributed scheduler; generic distributed DAG engine; high performance computing; linear algebra factorization; micro-benchmarks; micro-tasks management; problem-size independent format; scientific computing community; state-of-the-art programming environments; task priority; totally distributed fashion; Benchmark testing; Computer architecture; Engines; Instruction sets; Niobium; Processor scheduling; Tiles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location :
Shanghai
ISSN :
1530-2075
Print_ISBN :
978-1-61284-425-1
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2011.281
Filename :
6008964
Link To Document :
بازگشت