• DocumentCode
    413108
  • Title

    Priority-driven active data prefetching

  • Author

    Zhu, Ming ; Narravula, Harsha ; Katsinis, Constantine ; Hecht, Diana

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Drexel Univ., Philadelphia, PA, USA
  • fYear
    2004
  • fDate
    26-30 April 2004
  • Firstpage
    256
  • Abstract
    Summary form only given. Data cache misses reduce the performance of wide-issue processors by stalling the data supply to the processor. It is especially worse in the DSM environment. Prefetching data for the critical data address misses is one way to tolerate the cache miss latencies. But current applications with irregular access patterns make it difficult to prefetch data sufficiently early to mask large cache miss latencies, especially in multithreaded applications. To complement prefetching in a multithreaded environment, this paper proposes an approach to prefetch data addresses by a priority-driven method. The method introduced in this paper is a novel approach for dynamically identifying and precompiling the data addresses of the instructions marked as in a higher priority critical path of an application. The critical path can be identified at compile-time or run-time. A separate engine calculates the data addresses of the identified instructions in the critical path and prefetches early enough, the data that are used in the next critical instruction. Preliminary results show that a priority-driven prefetching is useful. It reduces the completion time of an application significantly. The approach improved the overall performance in three experiments conducted with active prefetching, over traditional prefetching, especially in the matrix-matrix multiplication, in our simulator.
  • Keywords
    cache storage; distributed shared memory systems; multi-threading; critical path; data cache miss latencies; matrix-matrix multiplication; multithreaded environment; priority-driven active data prefetching; Application software; Computer aided instruction; Concurrent computing; Delay; Distributed computing; Engines; Prefetching; Processor scheduling; Runtime; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
  • Print_ISBN
    0-7695-2132-0
  • Type

    conf

  • DOI
    10.1109/IPDPS.2004.1303322
  • Filename
    1303322