Title :
Overcoming the memory wall in packet processing
Author :
Mudigonda, J. ; Vin, H.M. ; Yavatkar, R.
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX
Abstract :
Overhead of memory accesses limits the performance of packet processing applications. To overcome this bottleneck, today´s network processors can utilize a wide-range of mechanisms - such as multi-level memory hierarchy, wide-word accesses, special-purpose result-caches, asynchronous memory, and hardware multi-threading. However, supporting all of these mechanisms complicates programmability and hardware design, and wastes system resources. In this paper, we address the following fundamental question: what minimal set of hardware mechanisms must a network processor support to achieve the twin goals of simplified programmability and high packet throughput? We show that no single mechanism sufficies; the minimal set must include data-caches and multi-threading. Data-caches and multi-threading are complementary; whereas data- caches exploit locality to reduce the number of context-switches and the off-chip memory bandwidth requirement, multi-threading exploits parallelism to hide long cache-miss latencies.
Keywords :
internetworking; multi-threading; asynchronous memory; hardware multi-threading; memory access overhead; memory wall; multi-level memory hierarchy; network processor; off-chip memory bandwidth requirement; packet processing; special-purpose result-caches; wide-word accesses; Application software; Bandwidth; Delay; Hardware; Internetworking; Multithreading; Network address translation; Parallel processing; Permission; Throughput; data-caches; multithreading; network processors;
Conference_Titel :
Architecture for networking and communications systems, 2005. ANCS 2005. Symposium on
Conference_Location :
Princeton, NJ
Print_ISBN :
978-1-59593-082-8