Title :
A Mapreduce programming framework using message passing
Author :
Ho, Yu-Fan ; Chen, Sih-Wei ; Chen, Chang-Yi ; Hsu, Yung-Ching ; Liu, Pangfeng
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
MapReduce is a very popular parallel programming model for processing large data sets. This paper discusses strategies in implementing a MapReduce runtime system using Message Passing Interface (MPI) library. The implementation uses blocking communication function in MPI, e.g. MPI_Send and MPI_Recv, to transfer intermediate data, so as to make the communication between mappers and reducers in MapReduce model much more efficient. Experiment results indicate that our MPI implementation performs better than Hadoop when the data volume is below 60MB, and perform five times better then native Hadoop when the input size is below 5MB.
Keywords :
message passing; parallel programming; very large databases; MapReduce runtime system; Mapreduce programming; blocking communication function; data volume; large data sets; message passing interface library; parallel programming; Computational modeling; Computer architecture; File systems; Google; Message passing; Programming; Runtime; MPI; MapReduce;
Conference_Titel :
Computer Symposium (ICS), 2010 International
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-7639-8
DOI :
10.1109/COMPSYM.2010.5685386