DocumentCode :
3414072
Title :
Coherent block data transfer in the FLASH multiprocessor
Author :
Heinlein, John ; Bosch, Robert P., Jr. ; Gharachorloo, Kourosh ; Rosenblum, Mendel ; Gupta, Anoop
Author_Institution :
Comput. Syst. Lab., Stanford Univ., CA, USA
fYear :
1997
fDate :
1-5 Apr 1997
Firstpage :
18
Lastpage :
27
Abstract :
A key goal of the Stanford FLASH project is to explore the integration of multiple communication protocols in a single multiprocessor architecture. To achieve this goal, FLASH includes a programmable node controller called MAGIC, which contains an embedded protocol processor capable of implementing multiple protocols. In this paper we present a specialized protocol for block data transfer integrated with a conventional cache coherence protocol. Block transfer forms the basis for message passing implementations on top of shared memory, occurs in important workloads such as databases, and is frequently used by the operating system. We discuss the issues that arise in designing a fully integrated protocol and its interactions with cache coherence. Using microbenchmarks, MPI communication primitives, and an application running on the operating system, we compare our protocol with standard bcopy and bcopy augmented with prefetches. Our results show that integrated block transfer can accelerate communication between nodes while off-loading the task from the main processor utilizing the network more efficiently, and reducing the associated cache pollution. Given the aggressive support for prefetching in FLASH, prefetched bcopy is able to achieve competitive performance in many cases but lacks the other three advantages of our protocol
Keywords :
cache storage; memory protocols; multiprocessing systems; shared memory systems; FLASH; FLASH multiprocessor; MAGIC; block data transfer; cache coherence protocol; embedded protocol processor; multiple communication protocols; multiprocessor architecture; prefetching; protocol; shared memory; Acceleration; Communication system control; Computer architecture; Databases; Laboratories; Message passing; Operating systems; Pollution; Prefetching; Protocols;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing Symposium, 1997. Proceedings., 11th International
Conference_Location :
Genva
ISSN :
1063-7133
Print_ISBN :
0-8186-7793-7
Type :
conf
DOI :
10.1109/IPPS.1997.580836
Filename :
580836
Link To Document :
بازگشت