DocumentCode :
2064017
Title :
A Load/Store Unit for a Memcpy Hardware Accelerator
Author :
Vassiliadis, Stamatis ; Duarte, Filipa ; Wong, Stephan
Author_Institution :
Delft Univ. of Technol., Delft
fYear :
2007
fDate :
27-29 Aug. 2007
Firstpage :
537
Lastpage :
541
Abstract :
Recently, a dedicated hardware accelerator was proposed that works in conjunction with caches found next to modern-day microprocessors, to speedup the commonly utilized memcpy operation. The main assumption of the proposal was that the to-be-memcpy-ed data has to reside inside the cache, which is not always valid. In this paper, we present a dedicated load/store unit and its implementation which cooperates with the previously proposed memcpy hardware accelerator and cache to ensure that data becomes available in the cache. Experimental results, using synthetic benchmarks, show that the load/store unit in conjunction with the memcpy hardware accelerator is capable of reducing the memcpy latencies by 85% (when the data is not present in the cache) compared to a highly optimized, hand-coded in assembly software solution.
Keywords :
cache storage; storage allocation; assembly software solution; load-store unit; memcpy hardware accelerator; synthetic benchmarks; Acceleration; Application software; Assembly; Bluetooth; Delay; Field programmable gate arrays; Hardware; Microprocessors; Proposals; Software libraries;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field Programmable Logic and Applications, 2007. FPL 2007. International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4244-1060-6
Electronic_ISBN :
978-1-4244-1060-6
Type :
conf
DOI :
10.1109/FPL.2007.4380711
Filename :
4380711
Link To Document :
بازگشت