DocumentCode :
3191587
Title :
Dual-JT: Toward the high availability of JobTracker in Hadoop
Author :
Jian Wan ; Minggang Liu ; Xixiang Hu ; Zujie Ren ; Jilin Zhang ; Weisong Shi ; Wei Wu
Author_Institution :
Sch. of Comput. Sci. & Technol., Hangzhou Dianzi Univ., Hangzhou, China
fYear :
2012
fDate :
3-6 Dec. 2012
Firstpage :
263
Lastpage :
268
Abstract :
MapReduce is a state-of-the-art computation paradigm that is becoming widely used for processing large-scale datasets. Hadoop is an open-source implementation of MapReduce and follows a masterCslave architecture. This architecture makes Hadoop suffer from a single point of failure in the JobTracker. In this paper, we design a solution to resolve the single point of failure of the Job Tracker and then enhance its availability. In this solution, a standby Job Tracker is introduced to act as a hot backup node of the active Job Tracker. The standby Job Tracker synchronizes the job execution process with the active Job Tracker by collecting and parsing the job log. If the active Job Tracker fails, the standby Job Tracker can take over quickly. This solution is implemented in Hadoop 0.20.x. Extensive experiments illustrate that this solution effectively enhances the availability of Job Tracker. A big production cluster in a large e-Commerce company has adopted this solution, which avoids interrupting job submission and execution when the Job Tracker fails or restarts.
Keywords :
distributed processing; electronic commerce; public domain software; Dual-JT; Hadoop 0.20.x; JobTracker; MapReduce; computation paradigm; e-commerce company; hot backup node; job execution process; job log collection; job log parsing; large-scale dataset processing; masterCslave architecture; open-source implementation; single point of failure; Availability; Computer architecture; Conferences; Delay; IP networks; Real-time systems; Synchronization; Hadoop; High Available; MapReduce; Single Point of Failure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2012 IEEE 4th International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4673-4511-8
Electronic_ISBN :
978-1-4673-4509-5
Type :
conf
DOI :
10.1109/CloudCom.2012.6427485
Filename :
6427485
Link To Document :
بازگشت