Title :
Exploiting the Inter-cluster Record Reuse for Stream Processors
Author :
Ying Zhang ; Gen Li ; Caixia Sun ; Hongwei Zhou ; Fayuan Wang
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Memory accesses limit the performance of stream processors. The stream compiler exploits the reuse of records distributed on different ALU clusters by introducing inter-cluster communications, which decreases the program performance. The paper presents the Stream Transpose (ST) approach to exploit such reuse. The approach, by reorganizing the data, puts data that have been distributed on neighboring ALU clusters on the same ALU cluster, hence exploiting the reuse without any inter-Cluster communications. The experimental results show the approach can exploit the reuse of records distributed among ALU clusters without any inter-cluster communications or any decrease of accessing streams, and gains at most 1.46 speedup over the approach with inter-cluster communication.
Keywords :
microprocessor chips; multiprocessing systems; program compilers; storage management; ALU cluster; intercluster communication; intercluster record reuse; memory access; stream processor; stream transpose approach; Arrays; Clustering algorithms; Kernel; Optimization; Program processors; Streaming media;
Conference_Titel :
High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on
Print_ISBN :
978-1-4799-6122-1
DOI :
10.1109/HPCC.2014.159