• DocumentCode
    2285673
  • Title

    Architecture and compiler co-optimization for high performance computing

  • Author

    Nakamura, Hiroshi ; Kondo, Masaaki ; Ohneda, Taku ; Fujita, Motonobu ; Chiba, Shigeru ; Sato, Mitsuhisa ; Boku, Taisuke

  • Author_Institution
    Res. Center for Adv. Sci. & Technol., Univ. of Tokyo, Japan
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    50
  • Lastpage
    56
  • Abstract
    The performance gap between processor and memory is very serious problem in high-performance computing because effective performance is limited by memory ability. In order to overcome this problem, it is indispensable to make good use of wide on-chip memory bandwidth. For this purpose, architecture and compiler co-optimization is a promising approach because most data access is regular and/or predictable in high performance computing. Thus, we propose a new VLSI architecture called SCIMA as a platform of the co-optimization. SCIMA integrates software controllable memory (SCM) into a processor chip in addition to ordinary data cache. SCM and cache can be reconfigured by software during computation. Hence, the memory hierarchy itself is the target of compiler optimization. In this sense, architecture and compiler co-optimization is realized in SCIMA. Towards the co-optimization, we have developed a directive-based compiler and an algorithm of SCM usage to insert directives automatically. In this paper, we present the directives and the outline of the algorithm for automatic optimization.
  • Keywords
    VLSI; cache storage; memory architecture; optimising compilers; parallel architectures; performance evaluation; SCIMA; VLSI architecture; architecture compiler co-optimization; automatic optimization; cache reconfiguration; compiler optimization; directive-based compiler; high performance computing; memory hierarchy; performance gap; processor chip; software controllable memory; wide on-chip memory bandwidth; Bandwidth; Computer architecture; Delay; Electronic mail; Hardware; High performance computing; Optimizing compilers; Prefetching; Throughput; Very large scale integration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Architecture for Future Generation High-Performance Processors and Systems, 2002. International Workshop on
  • ISSN
    1537-3223
  • Print_ISBN
    0-7695-1635-1
  • Type

    conf

  • DOI
    10.1109/IWIA.2002.1035018
  • Filename
    1035018