مرکز منطقه ای اطلاع رساني علوم و فناوري - Dataflow language compilation for a single chip massively parallel processor

DocumentCode :

3546841

Title :

Dataflow language compilation for a single chip massively parallel processor

Author :

de Dinechin, Benoit Dupont

Author_Institution :

CTO, Kalray, France

fYear :

2013

fDate :

7-7 Sept. 2013

Firstpage :

Lastpage :

Abstract :

Summary form only given. The Kalray MPPA-256 processor (Multi-Purpose Processing Array) integrates 256 processing engine (PE) cores and 32 resource management (RM) cores on a single 28nm CMOS chip. These cores are distributed across 16 compute clusters and 4 I/O subsystems. On-chip communications and synchronizations are supported by an explicitly addressed dual network-on-chip (NoC), with one node per compute cluster and 4 nodes per 4 I/O subsystem. The Kalray MPPA software development kit includes a complete programming environment for a C-based dataflow language, whose compiler fully automates the distributed execution of tasks across the processing, memory, communication and synchronization resources of the MPPA architecture. We first introduce the model of computation of the Kalray dataflow language, which is based on cyclostatic dataflow with extensions such as the firing thresholds of Karp & Miller computation graphs. We then describe the main steps of dataflow compilation to a distributed execution platform. These include: task sequencing, communication buffer sizing, task clustering, DMA engine exploitation, place & route, NoC bandwidth allocation, and generation of run-time tables. Finally, we discuss the suitability and restrictions of this and related static dataflow models of computations with regards to the dynamic and real-time requirements of embedded applications targeted by the MPPA processor.

Keywords :

CMOS integrated circuits; network-on-chip; parallel architectures; C-based dataflow language; CMOS chip; DMA engine exploitation; Kalray MPPA software development; Kalray MPPA-256 processor; Kalray dataflow language; MPPA architecture; NoC bandwidth allocation; communication buffer sizing; cyclostatic dataflow; dataflow language compilation; distributed execution platform; dual network-on-chip; multipurpose processing array; place & route; processing engine cores; resource management cores; single chip massively parallel processor; size 28 nm; task clustering; task sequencing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multi-/Many-core Computing Systems (MuCoCoS), 2013 IEEE 6th International Workshop on

Conference_Location :

Edinburgh

Type :

conf

DOI :

10.1109/MuCoCoS.2013.6633597

Filename :

6633597

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3546841