DocumentCode
2404100
Title
Instruction replication for clustered microarchitectures
Author
Aletà, Alex ; Codina, Josep M. ; González, Antonio ; Kaeli, David
Author_Institution
Dept. of Comput. Archit., UPC, Barcelona, Spain
fYear
2003
fDate
3-5 Dec. 2003
Firstpage
326
Lastpage
335
Abstract
This work presents a new compilation technique that uses instruction replication in order to reduce the number of communications executed on a clustered microarchitecture. For such architectures, the need to communicate values between clusters can result in a significant performance loss. Inter-cluster communications can be reduced by selectively replicating an appropriate set of instructions. However, instruction replication must be done carefully since it may also degrade performance due to the increased contention it can place on processor resources. The proposed scheme is built on top of a previously proposed state-of-the-art modulo scheduling algorithm that effectively reduces communications. Results show that the number of communications can decrease using replication, which results in significant speed-ups. IPC is increased by 25% on average for a 4-cluster microarchitecture and by as mush as 70% for selected programs.
Keywords
instruction sets; parallel architectures; parallelising compilers; clustered microarchitectures; compilation technique; instruction replication; instructions set; intercluster communications; processor resources; scheduling algorithm; Computer architecture; Degradation; Delay; Digital signal processing; Microarchitecture; Performance loss; Power dissipation; Processor scheduling; Scheduling algorithm; Wire;
fLanguage
English
Publisher
ieee
Conference_Titel
Microarchitecture, 2003. MICRO-36. Proceedings. 36th Annual IEEE/ACM International Symposium on
Print_ISBN
0-7695-2043-X
Type
conf
DOI
10.1109/MICRO.2003.1253206
Filename
1253206
Link To Document