Title :
Optimization of Parallel I/O for Cannon´s Algorithm Based on Lustre
Author :
Li, Yunchun ; Li, Hongda
Author_Institution :
Comput. Sci. Dept., Beihang Univ., Beijing, China
Abstract :
Matrix multiplication is one of the most important operations in linear algebra, widely used in many fields of science and engineering. Cannon´s algorithm is a classical distributed algorithm for matrix multiplication for two-dimensional meshes. Generally, MPI-IO is used for its I/O requirements. However it has been well documented that MPI-IO performs poorly in a Lustre file system environment. As the scale of matrix multiplication increased, this problem trends to be serious, becoming one key factor impacting performance of the program. In order to improve the performance of Collective I/O of Cannon´s program, we proposed a new aggregation pattern (Stripe-continuous aggregation pattern), which fully considers the stripping mechanism and lock protocol of Lustre file system. The theoretical analysis and experimental results show that the pattern can make full use of the capacity of Lustre file system compared with the other patterns, and improve the I/O performance of the Cannon´s program efficiently.
Keywords :
file organisation; input-output programs; mathematics computing; matrix multiplication; message passing; parallel processing; 2D mesh; Cannon algorithm; Lustre file system environment; MPI-IO; linear algebra; lock protocol; matrix multiplication; message passing interface; parallel input-output optimization; stripe-continuous aggregation pattern; stripping mechanism; two-dimensional mesh; Computer architecture; File systems; Program processors; Protocols; Servers; Sparse matrices; Throughput; Cannon´s algorithm; Collective I/O; Lustre file system; MPI-IO; Parallel I/O;
Conference_Titel :
Distributed Computing and Applications to Business, Engineering & Science (DCABES), 2012 11th International Symposium on
Conference_Location :
Guilin
Print_ISBN :
978-1-4673-2630-8
DOI :
10.1109/DCABES.2012.61