DocumentCode :
3546841
Title :
Dataflow language compilation for a single chip massively parallel processor
Author :
de Dinechin, Benoit Dupont
Author_Institution :
CTO, Kalray, France
fYear :
2013
fDate :
7-7 Sept. 2013
Firstpage :
1
Lastpage :
1
Abstract :
Summary form only given. The Kalray MPPA-256 processor (Multi-Purpose Processing Array) integrates 256 processing engine (PE) cores and 32 resource management (RM) cores on a single 28nm CMOS chip. These cores are distributed across 16 compute clusters and 4 I/O subsystems. On-chip communications and synchronizations are supported by an explicitly addressed dual network-on-chip (NoC), with one node per compute cluster and 4 nodes per 4 I/O subsystem. The Kalray MPPA software development kit includes a complete programming environment for a C-based dataflow language, whose compiler fully automates the distributed execution of tasks across the processing, memory, communication and synchronization resources of the MPPA architecture. We first introduce the model of computation of the Kalray dataflow language, which is based on cyclostatic dataflow with extensions such as the firing thresholds of Karp & Miller computation graphs. We then describe the main steps of dataflow compilation to a distributed execution platform. These include: task sequencing, communication buffer sizing, task clustering, DMA engine exploitation, place & route, NoC bandwidth allocation, and generation of run-time tables. Finally, we discuss the suitability and restrictions of this and related static dataflow models of computations with regards to the dynamic and real-time requirements of embedded applications targeted by the MPPA processor.
Keywords :
CMOS integrated circuits; network-on-chip; parallel architectures; C-based dataflow language; CMOS chip; DMA engine exploitation; Kalray MPPA software development; Kalray MPPA-256 processor; Kalray dataflow language; MPPA architecture; NoC bandwidth allocation; communication buffer sizing; cyclostatic dataflow; dataflow language compilation; distributed execution platform; dual network-on-chip; multipurpose processing array; place & route; processing engine cores; resource management cores; single chip massively parallel processor; size 28 nm; task clustering; task sequencing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multi-/Many-core Computing Systems (MuCoCoS), 2013 IEEE 6th International Workshop on
Conference_Location :
Edinburgh
Type :
conf
DOI :
10.1109/MuCoCoS.2013.6633597
Filename :
6633597
Link To Document :
بازگشت