DocumentCode
154126
Title
TRAM: Optimizing Fine-Grained Communication with Topological Routing and Aggregation of Messages
Author
Wesolowski, Lukasz ; Venkataraman, R. ; Gupta, Arpan ; Jae-Seung Yeom ; Bisset, Keith ; Yanhua Sun ; Jetley, Pritish ; Quinn, Thomas R. ; Kale, Laxmikant V.
Author_Institution
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear
2014
fDate
9-12 Sept. 2014
Firstpage
211
Lastpage
220
Abstract
Fine-grained communication in supercomputing applications often limits performance through high communication overhead and poor utilization of network bandwidth. This paper presents Topological Routing and Aggregation Module (TRAM), a library that optimizes fine-grained communication performance by routing and dynamically combining short messages. TRAM collects units of fine-grained communication from the application and combines them into aggregated messages with a common intermediate destination. It routes these messages along a virtual mesh topology mapped onto the physical topology of the network. TRAM improves network bandwidth utilization and reduces communication overhead. It is particularly effective in optimizing patterns with global communication and large message counts, such as all-to-all and many-to-many, as well as sparse, irregular, dynamic or data dependent patterns. We demonstrate how TRAM improves performance through theoretical analysis and experimental verification using benchmarks and scientific applications. We present speedups on petascale systems of 6x for communication benchmarks and up to 4x for applications.
Keywords
bandwidth allocation; message passing; TRAM; fine-grained communication; message aggregation; network bandwidth utilization; petascale system; supercomputing application; topological routing and aggregation module; virtual mesh topology; Aggregates; Bandwidth; Libraries; Network topology; Routing; Runtime; Topology; Communication Optimization; Interconnection Networks; Message Aggregation;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing (ICPP), 2014 43rd International Conference on
Conference_Location
Minneapolis MN
ISSN
0190-3918
Type
conf
DOI
10.1109/ICPP.2014.30
Filename
6957230
Link To Document