DocumentCode :
2149832
Title :
Enabling fine-grained OpenMP tasking on tightly-coupled shared memory clusters
Author :
Burgio, Paolo ; Tagliavini, Giuseppe ; Marongiu, Andrea ; Benini, Luca
Author_Institution :
DEIS - Universitá degli Studi di Bologna - Viale Risorgimento 2, 40136 - Italy
fYear :
2013
fDate :
18-22 March 2013
Firstpage :
1504
Lastpage :
1509
Abstract :
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These platforms can deliver very high peak performance within a contained power envelope, provided that programmers can make effective use the available parallel cores. This is becoming an extremely difficult task, as embedded applications are growing in complexity and exhibit irregular and dynamic parallelism. The OpenMP tasking extensions represent a powerful abstraction to capture this form of parallelism. However, efficiently supporting it on cluster-based embedded SoCs is not easy, because the fine-grained parallel workload present in embedded applications can not tolerate high memory and run-time overheads. In this paper we present our design of the runtime support layer to OpenMP tasking for an embedded shared memory cluster, identifying key aspects to achieving performance and discussing important architectural support to removing major bottlenecks.
Keywords :
Computer architecture; Instruction sets; Parallel processing; Programming; Runtime; Switches;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2013
Conference_Location :
Grenoble, France
ISSN :
1530-1591
Print_ISBN :
978-1-4673-5071-6
Type :
conf
DOI :
10.7873/DATE.2013.306
Filename :
6513752
Link To Document :
بازگشت