• DocumentCode
    3261374
  • Title

    An operation placement and scheduling scheme for cache and communication localities in fine-grain parallel architectures

  • Author

    Chen, Yen-Kuang ; Kung, S.Y.

  • Author_Institution
    Dept. of Electr. Eng., Princeton Univ., NJ, USA
  • fYear
    1997
  • fDate
    18-20 Dec 1997
  • Firstpage
    390
  • Lastpage
    396
  • Abstract
    With increasing on-chip hardware, concurrency is a way to bridge the gap between the computational power demanded by the applications and that afforded by the computer platforms. Although parallel systems are increasingly popular they remain very difficult to program. In fact, most compilers require the programmer to specify how to partition data or map program code to the system´s processors. To ensure an effective program, cache locality is important because of the large speed gap between microprocessors and memory systems. It is also important to make use of local communication whenever possible, since it is cheaper faster and less power hungry than global communication. In order to exploit these locality properties, we present a systematic operation placement and scheduling scheme for fine-grain parallel architectures. The key advantages are twofolds: (1) This multiprojection method, which deals with multidimensional parallelism systematically, can alleviate the burden of the programmer in coding and data partitioning. (2) it addresses the memory/communication bandwidth bottleneck, and can lend to faster program execution. On a special design example of the motion estimation block-matching algorithm, which requires the most intensive computation and memory accesses in video coding, our method lends to a reduction of external memory accesses by two to three orders of magnitude
  • Keywords
    cache storage; concurrency control; parallel architectures; processor scheduling; cache; communication localities; concurrency; fine-grain parallel architectures; local communication; multiprojection method; operation placement; parallel architectures; placement and scheduling; scheduling; Application software; Bridges; Concurrent computing; Global communication; Hardware; Microprocessors; Parallel architectures; Processor scheduling; Program processors; Programming profession;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Architectures, Algorithms, and Networks, 1997. (I-SPAN '97) Proceedings., Third International Symposium on
  • Conference_Location
    Taipei
  • ISSN
    1087-4089
  • Print_ISBN
    0-8186-8259-6
  • Type

    conf

  • DOI
    10.1109/ISPAN.1997.645125
  • Filename
    645125