Title :
A Load/Store Unit for a Memcpy Hardware Accelerator
Author :
Vassiliadis, Stamatis ; Duarte, Filipa ; Wong, Stephan
Author_Institution :
Delft Univ. of Technol., Delft
Abstract :
Recently, a dedicated hardware accelerator was proposed that works in conjunction with caches found next to modern-day microprocessors, to speedup the commonly utilized memcpy operation. The main assumption of the proposal was that the to-be-memcpy-ed data has to reside inside the cache, which is not always valid. In this paper, we present a dedicated load/store unit and its implementation which cooperates with the previously proposed memcpy hardware accelerator and cache to ensure that data becomes available in the cache. Experimental results, using synthetic benchmarks, show that the load/store unit in conjunction with the memcpy hardware accelerator is capable of reducing the memcpy latencies by 85% (when the data is not present in the cache) compared to a highly optimized, hand-coded in assembly software solution.
Keywords :
cache storage; storage allocation; assembly software solution; load-store unit; memcpy hardware accelerator; synthetic benchmarks; Acceleration; Application software; Assembly; Bluetooth; Delay; Field programmable gate arrays; Hardware; Microprocessors; Proposals; Software libraries;
Conference_Titel :
Field Programmable Logic and Applications, 2007. FPL 2007. International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4244-1060-6
Electronic_ISBN :
978-1-4244-1060-6
DOI :
10.1109/FPL.2007.4380711