Title :
In-Map/In-Reduce: Concurrent Job Execution in MapReduce
Author :
Idris, Muhammad ; Hussain, Shiraz ; Sungyoung Lee
Author_Institution :
Coll. of Electron. & Inf., Kyung Hee Univ., Seoul, South Korea
Abstract :
Hadoop based Map Reduce (MR) has emerged as big data processing mechanism in terms of its data intensive applications. In data intensive systems, analysis and visualizations as a result of various algorithms can lead to differentiable and comparable results. Current implementations of MR facilitates to reuse the results of MR jobs in other MR jobs and to distribute the cloud resources among jobs. However, very little work is done in terms of using same data for multiple algorithms at the same time in a single job using either shared resources or dynamic resource allocation based on the data and scheduling of Map Reduce jobs. In this paper we propose a method to execute multiple algorithms on same data in HDFS concurrently and to use the same available resources by dynamically managing the task assignment and results aggregation. Our proposed approach reduces the execution time and supports multiple algorithms execution in parallel. In-Map/In-Reduce shows 200% decrease in execution time.
Keywords :
Big Data; cloud computing; parallel programming; public domain software; resource allocation; scheduling; Big Data processing mechanism; HDFS; Hadoop based Map Reduce; In-Map/In-Reduce; MR; MapReduce jobs scheduling; cloud resources; concurrent job execution; data intensive systems; dynamic resource allocation; multiple parallel algorithms; task assignment; Algorithm design and analysis; Big data; Clustering algorithms; Distributed databases; Educational institutions; Partitioning algorithms; Big Data; Data Intensive Computing; HDFS; Hadoop; MapReduce;
Conference_Titel :
Trust, Security and Privacy in Computing and Communications (TrustCom), 2014 IEEE 13th International Conference on
Conference_Location :
Beijing
DOI :
10.1109/TrustCom.2014.100