DocumentCode :
3221842
Title :
Parallel execution of radix sort program using fine-grain communication
Author :
Kodama, Y. ; Sato, M. ; Sakane, H. ; Sakai, S. ; Hanpei, K. ; Yamaguchi, Y.
Author_Institution :
Comput. Sci. Div., Electrotech. Lab., Tsukuba, Japan
fYear :
1997
fDate :
10-14 Nov 1997
Firstpage :
136
Lastpage :
145
Abstract :
The report presents empirical results of fine-grain communication on the 80-processor EM-X distributed-memory multiprocessor. EM-X has hardware support for low latency, high throughput fine-grain communication-this hardware support includes packet generation integrated into the instruction execution pipeline for single-cycle communication overhead, direct memory access for remote references, and rapid context switching for latency tolerance. The authors study the fine-grain communication performance of integer radix sort, a code with irregular communication, on EM-X, and compare it to the Fujitsu AP1000+ and the Cray Server CS6400. The experimental results indicate that EM-X achieves high throughput and low overhead for fine-grain communication. Whereas EM-X´s communication performance scales perfectly as one increases the number of processors, other coarse-grain message-passing machines exhibit fluctuation and performance degradation for larger configurations due to network contention
Keywords :
digital arithmetic; distributed memory systems; file organisation; message passing; parallel algorithms; parallel machines; performance evaluation; pipeline arithmetic; shared memory systems; sorting; Cray Server CS6400; EM-X distributed-memory multiprocessor; Fujitsu AP1000+; coarse-grain message-passing machines; communication performance; direct memory access; hardware support; instruction execution pipeline; integer radix sort; irregular communication code; latency tolerance; low latency high throughput fine-grain communication; network contention; packet generation; parallel execution; radix sort program; rapid context switching; remote references; single-cycle communication overhead; Communication switching; Context; Degradation; Delay; Fluctuations; Hardware; Network servers; Packet switching; Pipelines; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures and Compilation Techniques., 1997. Proceedings., 1997 International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-8186-8090-3
Type :
conf
DOI :
10.1109/PACT.1997.644010
Filename :
644010
Link To Document :
بازگشت