DocumentCode :
413108
Title :
Priority-driven active data prefetching
Author :
Zhu, Ming ; Narravula, Harsha ; Katsinis, Constantine ; Hecht, Diana
Author_Institution :
Dept. of Electr. & Comput. Eng., Drexel Univ., Philadelphia, PA, USA
fYear :
2004
fDate :
26-30 April 2004
Firstpage :
256
Abstract :
Summary form only given. Data cache misses reduce the performance of wide-issue processors by stalling the data supply to the processor. It is especially worse in the DSM environment. Prefetching data for the critical data address misses is one way to tolerate the cache miss latencies. But current applications with irregular access patterns make it difficult to prefetch data sufficiently early to mask large cache miss latencies, especially in multithreaded applications. To complement prefetching in a multithreaded environment, this paper proposes an approach to prefetch data addresses by a priority-driven method. The method introduced in this paper is a novel approach for dynamically identifying and precompiling the data addresses of the instructions marked as in a higher priority critical path of an application. The critical path can be identified at compile-time or run-time. A separate engine calculates the data addresses of the identified instructions in the critical path and prefetches early enough, the data that are used in the next critical instruction. Preliminary results show that a priority-driven prefetching is useful. It reduces the completion time of an application significantly. The approach improved the overall performance in three experiments conducted with active prefetching, over traditional prefetching, especially in the matrix-matrix multiplication, in our simulator.
Keywords :
cache storage; distributed shared memory systems; multi-threading; critical path; data cache miss latencies; matrix-matrix multiplication; multithreaded environment; priority-driven active data prefetching; Application software; Computer aided instruction; Concurrent computing; Delay; Distributed computing; Engines; Prefetching; Processor scheduling; Runtime; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
Type :
conf
DOI :
10.1109/IPDPS.2004.1303322
Filename :
1303322
Link To Document :
بازگشت