DocumentCode :
685820
Title :
An approach for log analysis based failure monitoring in Hadoop cluster
Author :
Mohandas, Madhury ; Dhanya, P.M.
Author_Institution :
Dept. of Comput. Sci. & Eng., Rajagiri Sch. of Eng. & Technol., Kochi, India
fYear :
2013
fDate :
12-14 Dec. 2013
Firstpage :
861
Lastpage :
867
Abstract :
Massive and gargantuan amount of data is produced on per day basis. Such scenario elevates the need for apposite storage, supervision and processing of data. The massive use of Distributed framework calls for faster analysis and diagnosis of failures. Due to the distributed nature of processing, it is difficult for cluster administrator to isolate the failures and failed nodes. Many contributions have been done for failure monitoring, analysis etc in the last few years. Apache Hadoop´s Jobtracker, Namenode, Secondary Namenode, Datanode and Tasktracker all generate logs. This paper aims at building a failure monitoring system from the scratch, by parsing and analyzing the Hadoop log files generated in the cluster. The monitoring system gives all relevant details related to the application, and points out the specific reason for failure, that is, whether an application failure or a network failure (these are the most common failures in the cluster).
Keywords :
Java; fault diagnosis; program diagnostics; public domain software; Apache Hadoop Jobtracker; Datanode; Hadoop cluster; Namenode; Secondary Namenode; Tasktracker; distributed framework; failure analysis; failure diagnosis; failure monitoring system; log analysis based failure monitoring; open source Java software framework; Computational modeling; Computer architecture; File systems; Google; History; Monitoring; BigData; Failure Monitoring; HDFS; Hadoop; Log Analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Green Computing, Communication and Conservation of Energy (ICGCE), 2013 International Conference on
Conference_Location :
Chennai
Type :
conf
DOI :
10.1109/ICGCE.2013.6823555
Filename :
6823555
Link To Document :
بازگشت