DocumentCode :
2846461
Title :
ParColl: Partitioned Collective I/O on the Cray XT
Author :
Weikuan Yu ; Vetter, Jeffrey
Author_Institution :
Comput. Sci. & Math., Oak Ridge Nat. Lab., Oak Ridge, TN
fYear :
2008
fDate :
9-12 Sept. 2008
Firstpage :
562
Lastpage :
569
Abstract :
Collective I/O orchestrates I/O from parallel processes by aggregating fine-grained requests into large ones. However, its performance is typically a fraction of the potential I/O bandwidth on large scale platforms such as Cray XT. Based on our analysis, the time spent in global process synchronization dominates the actual time in file reads/writes, which imposes a ´collective wall´ on the performance of collective I/O. In this paper, we introduce a novel technique called partitioned collective I/O (ParColl). ParColl augments the original two-phase collective I/O protocol with new mechanisms for file area partitioning, I/O aggregator distribution and intermediate file views. Through these mechanisms, a group of processes and their targeted file are consistently divided into a collection of small subgroups, each performing I/O aggregation in a disjoint manner. File consistency is maintained through intermediate file views when necessary. Together, these mechanisms greatly reduce the cost of global synchronization. Our experimental results demonstrate that ParColl significantly improves the performance and the scalability of collective I/O. In one case, we show a 416% improvement on 1024 processes for a visualization I/O benchmark. We also show that the I/O patterns in scientific applications can benefit significantly from this technique, e.g. BT-I/O and Flash I/O.
Keywords :
file organisation; parallel processing; synchronisation; Cray XT; I-O aggregator distribution; ParColl; file area partitioning; file consistency; global process synchronization; intermediate file views; parallel processes; partitioned collective I-O; Bandwidth; Computer science; Costs; Laboratories; Large-scale systems; Mathematics; Parallel processing; Protocols; Scalability; Throughput; Collective Wall; Cray XT; Partitioned Collective I/O;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing, 2008. ICPP '08. 37th International Conference on
Conference_Location :
Portland, OR
ISSN :
0190-3918
Print_ISBN :
978-0-7695-3374-2
Electronic_ISBN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2008.76
Filename :
4625894
Link To Document :
بازگشت