• DocumentCode
    2404074
  • Title

    Universal mechanisms for data-parallel architectures

  • Author

    Sankaralingam, Karthikeyan ; Keckler, Stephen W. ; Mark, William R. ; Burger, Doug

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at Austin, TX, USA
  • fYear
    2003
  • fDate
    3-5 Dec. 2003
  • Firstpage
    303
  • Lastpage
    314
  • Abstract
    Data-parallel programs are both growing in importance and increasing in diversity, resulting in specialized processors targeted at specific classes of these programs. This paper presents a classification scheme for data-parallel program attributes, and proposes micro-architectural mechanisms to support applications with diverse behavior for using a single reconfigurable architecture. We focus on the following four broad kinds of data-parallel programs - DSP/multimedia, scientific, networking, and real-time graphics workloads. While all of these programs exhibit high computational intensity, coarse-grain regular control behavior, and some regular memory access behavior, they show wide variance in the computation requirements, fine grain control behavior, and the frequency of other types of memory accesses. Based on this study of application attributes, this paper proposes a set of general micro-architectural mechanisms that enable a baseline architecture to be dynamically tailored to the demands of a particular application. These mechanisms provide efficient execution across a spectrum of data-parallel application and can be applied to diverse architectures ranging from vector cores to conventional superscalar cores. Our results using a baseline TRIPS processor show that the configurability of the architecture to the application demands provides harmonic mean performance improvement of 5%-55% over scalable yet less flexible architectures, and performs competitively against specialized architectures.
  • Keywords
    classification; memory architecture; parallel architectures; parallel programming; reconfigurable architectures; signal processing; TRIPS processor; classification scheme; computational intensity; data-parallel architectures; data-parallel programs; digital signal processing; flexible architectures; memory access; microarchitectural mechanisms; multimedia; networking; real-time graphics; reconfigurable architecture; Application software; Communication system control; Computational modeling; Computer architecture; Digital signal processing chips; Engines; Graphics; Hardware; Laboratories; Reconfigurable architectures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Microarchitecture, 2003. MICRO-36. Proceedings. 36th Annual IEEE/ACM International Symposium on
  • Print_ISBN
    0-7695-2043-X
  • Type

    conf

  • DOI
    10.1109/MICRO.2003.1253204
  • Filename
    1253204