DocumentCode
3581363
Title
An approach for optimization of resource management in Hadoop
Author
Raj, R. Sandeep ; Raju, G. Prabhakar
Author_Institution
Dept. of Comput. Sci. & Eng., Anurag Group of Instn., Hyderabad, India
fYear
2014
Firstpage
1
Lastpage
5
Abstract
Many tools and frameworks have been developed to process data on distributed data centers. MapReduce most prominent among such frameworks has emerged as a popular distributed data processing model for processing vast amount of data in parallel on large clusters of commodity machines. The JobTracker in MapReduce framework is responsible for both managing the cluster´s resources and executing the MapReduce jobs, a constraint that limits scalability, resource utilization. YARN the next-generation execution layer for Hadoop splits processing and resource management capabilities of JobTracker into separate entities and eliminates the dependency of Hadoop on MapReduce. This new model is more isolated and scalable compared to MapReduce, providing improved features and functionality. This paper discusses the design of YARN and significant advantages over traditional MapReduce.
Keywords
computer centres; data handling; optimisation; parallel processing; resource allocation; Hadoop; JobTracker; MapReduce framework; MapReduce jobs; YARN design; commodity machines; distributed data centers; distributed data processing model; next-generation execution layer; resource management optimization; resource utilization; Computational modeling; Containers; Distributed databases; Programming; Resource management; Scalability; Yarn; BigData; Hadoop; MapReduce; Scalability; YARN;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Communications Technologies (ICCCT), 2014 International Conference on
Type
conf
DOI
10.1109/ICCCT2.2014.7066747
Filename
7066747
Link To Document