DocumentCode
3219262
Title
Impact on performance of fused multiply-add units in aggressive VLIW architectures
Author
López, David ; Llosa, Josep ; Ayguadé, Eduard ; Valero, Mateo
Author_Institution
Dept. d´´Arquitectura de Computadors, Univ. Politecnica de Catalunya, Barcelona, Spain
fYear
1999
fDate
1999
Firstpage
22
Lastpage
29
Abstract
Loops are the main time consuming part of programs based on floating point computations. The performance of the loops is limited either by recurrences in the computation or by the resources offered by the architecture. Several general-purpose superscalar microprocessors have been implemented with multiply-add fused floating-point units, that reduces the latency of the combined operation and the number of resources used. This paper analyses the influence of these two factors in the instruction-level parallelism exploitable from loops executed on a broad set of future aggressive processor configurations. The estimation of implementation costs (area and cycle time) enables a fair comparison of these configurations in terms of final performance and implementation feasibility. The paper performs technological projection for the next years in order to foresee the possible implementable alternatives. From this study we conclude that multiply-add fused units may have a deep impact in raising the performance of future processor architectures with a reasonable increase in cost
Keywords
floating point arithmetic; parallel architectures; performance evaluation; aggressive VLIW architectures; aggressive processor configurations; floating point computations; fused multiply-add units; implementation feasibility; instruction-level parallelism; performance; superscalar microprocessors; Arithmetic; Computer architecture; Costs; Delay; Estimation theory; Optimal scheduling; Processor scheduling; Registers; VLIW;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing, 1999. Proceedings. 1999 International Conference on
Conference_Location
Aizu-Wakamatsu City
ISSN
0190-3918
Print_ISBN
0-7695-0350-0
Type
conf
DOI
10.1109/ICPP.1999.797384
Filename
797384
Link To Document