DocumentCode :
656202
Title :
ConMR: Concurrent MapReduce Programming Model for Large Scale Shared-Data Applications
Author :
Fan Zhang ; Malluhi, Qutaibah M. ; Elsyed, Tamer M.
Author_Institution :
Dept. of Comput. Sci., Carnegie Mellon Univ. Qatar, Doha, Qatar
fYear :
2013
fDate :
1-4 Oct. 2013
Firstpage :
671
Lastpage :
679
Abstract :
The rapid growth of large-data processing has brought in the MapReduce programming model as a widely accepted solution. However, MapReduce limits itself to a one map-to-one-reduce framework. Meanwhile, it lacks built-in support and optimization when the input datasets are shared among concurrent applications and/or jobs. The performance might be improved when the shared and frequently accessed data is read from local instead of distributed file system.To enhance the performance of big data applications, this paper presents Concurrent MapReduce, a new programming model built on top of MapReduce that deals with large amount of shared data items. Concurrent MapReduce provides support for processing heterogeneous sources of input datasets and offers optimization when the datasets are partially or fully shared. Experimental evaluation has shown an execution runtime speedup of 4X compared to traditional nonconcurrent MapReduce implementation with a manageable time overhead.
Keywords :
concurrency control; data handling; parallel programming; ConMR model; Concurrent MapReduce programming model; distributed file system; large scale shared-data applications; large-data processing; map-to-one-reduce framework; time overhead; Data models; Distributed databases; Erbium; File systems; Optimization; Programming; Remuneration; Concurrency; MapReduce; Programming Model;;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing (ICPP), 2013 42nd International Conference on
Conference_Location :
Lyon
ISSN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2013.134
Filename :
6687405
Link To Document :
بازگشت