• DocumentCode
    1772626
  • Title

    An approach of processor core customization for stencil computation

  • Author

    Yanhua Li ; Youhui Zhang ; Jianfeng Yang ; Luk, Wayne ; Guangwen Yang ; Weimin Zheng

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2014
  • fDate
    18-20 June 2014
  • Firstpage
    182
  • Lastpage
    183
  • Abstract
    Architecture customization is believed as one of the most promising methods to meet ever-increasing computing needs and power density limitations. This paper presents an approach to enhance a preliminary customizable core with some common architecture features, to adapt to the specific applications while keeping the programming flexibility. Those features include several effective software/hardware co-optimizing strategies, such as loop tiling, pre-fetching, cache customization, customized Single Instruction Multiple Data (SIMD) and Direct Memory Access (DMA), as well as the necessary ISA extensions. Currently we select stencil computation as the research target. Detailed tests of power-efficiency to evaluate the effect of all these optimizations comprehensively shows impressive performance speedup and power efficiency, even compared to X86, GPU and FPGA platforms. All these proposed customizations here could be applied to other computing applications.
  • Keywords
    computer architecture; field programmable gate arrays; file organisation; graphics processing units; microprocessor chips; multiprocessing systems; parallel processing; DMA; FPGA platforms; GPU platforms; SIMD; X86; architecture customization; architecture features; cache customization; customizable core; customized single instruction multiple data; direct memory access; loop tiling; power density limitations; prefetching; processor core customization approach; programming flexibility; software/hardware cooptimizing strategies; stencil computation; Computer architecture; Energy consumption; Field programmable gate arrays; Graphics processing units; Hardware; Optimization; customizable processor; high-performance computing; software/hardware co-design; stencil computation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Application-specific Systems, Architectures and Processors (ASAP), 2014 IEEE 25th International Conference on
  • Conference_Location
    Zurich
  • Type

    conf

  • DOI
    10.1109/ASAP.2014.6868656
  • Filename
    6868656