Title :
Register organization for media processing
Author :
Rixner, Scott ; Dally, William J. ; Khailany, Brucek ; Mattson, Peter ; Kapasi, Ujval J. ; Owens, John D.
fDate :
6/22/1905 12:00:00 AM
Abstract :
Processor architectures with tens to hundreds of arithmetic units are emerging to handle media processing applications. These applications, such as image coding, image synthesis and image understanding, require arithmetic rates of up to 1011 operations per second. As the number of arithmetic units in a processor increases to meet these demands, register storage and communication between the arithmetic units dominate the area, delay and power of the arithmetic units. In this paper, we show that partitioning the register file along three axes reduces the cost of register storage and communication without significantly impacting performance. We develop a taxonomy of register architectures by partitioning across the data-parallel, instruction-level-parallel and memory-hierarchy axes, and by optimizing the hierarchical register organization for operation on streams of data. Compared to a centralized global register file, the most compact of these organizations reduces the register file area, delay and power dissipation of a media processor by factors of 195, 230 and 430 respectively. This reduction in cost is achieved with a performance degradation of only 8% on a representative set of media processing benchmarks
Keywords :
delays; digital arithmetic; file organisation; image processing; parallel memories; performance evaluation; arithmetic unit communication; cost reduction; data parallelism; data streams; delay; hierarchical register organization optimization; image coding; image synthesis; image understanding; instruction-level parallelism; media processing; memory hierarchy; partitioning; performance; performance degradation; power dissipation; processor architectures; register architecture taxonomy; register file area; register file partitioning; register storage; Decision support systems; Virtual reality;
Conference_Titel :
High-Performance Computer Architecture, 2000. HPCA-6. Proceedings. Sixth International Symposium on
Conference_Location :
Touluse
Print_ISBN :
0-7695-0550-3
DOI :
10.1109/HPCA.2000.824366