DocumentCode :
2342038
Title :
An Algorithm for Parallel Execution of Loops in Chip Multiprocessor Caches
Author :
Subha, S.
Author_Institution :
Santa Clara Univ., Santa Clara, CA, USA
fYear :
2009
fDate :
27-28 Oct. 2009
Firstpage :
85
Lastpage :
89
Abstract :
Loops are predominant in computer programs. Data dependencies in loops dictate their execution time. An algorithm to execute loops in parallel based on the preordering of data in is presented in this paper. This algorithm can be applied to chip multiprocessors. The algorithm performs a fair share allocation of the loops to the available processors. Data accessed for loops accessed in a processor are fetched into its level one cache. The preordered data in the level one cache improve the performance of the cache. An example is simulated for the proposed algorithm and a performance improvement of 92% is observed in the overall execution time.
Keywords :
cache storage; microprocessor chips; multiprocessing systems; parallel processing; program control structures; chip multiprocessor caches; computer programs; data dependencies; data preordering; fair share allocation; parallel loops execution; Communications technology; Concurrent computing; Partitioning algorithms; Yarn; Loops are predominant in computer programs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Recent Technologies in Communication and Computing, 2009. ARTCom '09. International Conference on
Conference_Location :
Kottayam, Kerala
Print_ISBN :
978-1-4244-5104-3
Electronic_ISBN :
978-0-7695-3845-7
Type :
conf
DOI :
10.1109/ARTCom.2009.60
Filename :
5328072
Link To Document :
بازگشت