DocumentCode
413108
Title
Priority-driven active data prefetching
Author
Zhu, Ming ; Narravula, Harsha ; Katsinis, Constantine ; Hecht, Diana
Author_Institution
Dept. of Electr. & Comput. Eng., Drexel Univ., Philadelphia, PA, USA
fYear
2004
fDate
26-30 April 2004
Firstpage
256
Abstract
Summary form only given. Data cache misses reduce the performance of wide-issue processors by stalling the data supply to the processor. It is especially worse in the DSM environment. Prefetching data for the critical data address misses is one way to tolerate the cache miss latencies. But current applications with irregular access patterns make it difficult to prefetch data sufficiently early to mask large cache miss latencies, especially in multithreaded applications. To complement prefetching in a multithreaded environment, this paper proposes an approach to prefetch data addresses by a priority-driven method. The method introduced in this paper is a novel approach for dynamically identifying and precompiling the data addresses of the instructions marked as in a higher priority critical path of an application. The critical path can be identified at compile-time or run-time. A separate engine calculates the data addresses of the identified instructions in the critical path and prefetches early enough, the data that are used in the next critical instruction. Preliminary results show that a priority-driven prefetching is useful. It reduces the completion time of an application significantly. The approach improved the overall performance in three experiments conducted with active prefetching, over traditional prefetching, especially in the matrix-matrix multiplication, in our simulator.
Keywords
cache storage; distributed shared memory systems; multi-threading; critical path; data cache miss latencies; matrix-matrix multiplication; multithreaded environment; priority-driven active data prefetching; Application software; Computer aided instruction; Concurrent computing; Delay; Distributed computing; Engines; Prefetching; Processor scheduling; Runtime; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN
0-7695-2132-0
Type
conf
DOI
10.1109/IPDPS.2004.1303322
Filename
1303322
Link To Document