DocumentCode :
565266
Title :
Platform 2012, a many-core computing accelerator for embedded SoCs: Performance evaluation of visual analytics applications
Author :
Melpignano, Diego ; Benini, Luca ; Flamand, Eric ; Jego, Bruno ; Lepley, Thierry ; Haugou, Germain ; Clermidy, Fabien ; Dutoit, Denis
Author_Institution :
AST, STMicroelectron., Grenoble, France
fYear :
2012
fDate :
3-7 June 2012
Firstpage :
1137
Lastpage :
1142
Abstract :
P2012 is an area- and power-efficient many-core computing accelerator based on multiple globally asynchronous, locally synchronous processor clusters. Each cluster features up to 16 processors with independent instruction streams sharing a multi-banked one-cycle access L1 data memory, a multi-channel DMA engine and specialized hardware for synchronization and aggressive power management. P2012 is 3D stacking ready and can be customized to achieve extreme area and energy efficiency by adding domain-specific HW IPs to the cluster. The first P2012 SoC prototype in 28nm CMOS will sample in Q3, featuring four 16-processor clusters, a 1MB L2 memory and delivering 80GOPS (with 32 bit single precision floating point support) in 18mm2 with 2W power consumption (worst-case). P2012 can run standard OpenCL™ and proprietary Native Programming Model SW components to achieve the highest level of control on application-to-resource mapping. A dedicated version of the OpenCV vision library is provided in the P2012 SW Development Kit to enable visual analytics acceleration. This paper will discuss preliminary performance measurements of common feature extraction and tracking algorithms, parallelized on P2012, versus sequential execution on ARM CPUs.
Keywords :
CMOS memory circuits; embedded systems; energy conservation; feature extraction; file organisation; microprocessor chips; multiprocessing systems; parallel architectures; performance evaluation; power aware computing; synchronisation; system-on-chip; 16-processor clusters; 3D stacking; ARM CPU; CMOS; L2 memory; OpenCV vision library; P2012; P2012 SW Development Kit; P2012 SoC prototype; application-to-resource mapping; area-efficient many-core computing accelerator; domain-specific HW IP; embedded SoC; energy efficiency; feature extraction; instruction streams; multibanked one-cycle access L1 data memory; multichannel DMA engine; multiple globally asynchronous locally synchronous processor clusters; native programming model; performance evaluation; performance measurements; power 2 W; power management; power-efficient many-core computing accelerator; sequential execution; size 28 nm; synchronization; tracking algorithms; visual analytics applications; Acceleration; Computer architecture; Fabrics; Hardware; Programming; Software; System-on-a-chip; 3D stacking; Low-power; SoC; computer vision; feature extraction; many-core; process aware;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design Automation Conference (DAC), 2012 49th ACM/EDAC/IEEE
Conference_Location :
San Francisco, CA
ISSN :
0738-100X
Print_ISBN :
978-1-4503-1199-1
Type :
conf
Filename :
6241648
Link To Document :
بازگشت