DocumentCode :
548528
Title :
Filtering error log as time series in complex service-based storage systems
Author :
Rao, Xiang ; Yin, Gang ; Wang, Huaimin ; Shi, Dianxi ; Zhu, Yanxu
Author_Institution :
Nat. Lab. for Parallel & Distrib. Process., Nat. Univ. of Defense Technol., Changsha, China
fYear :
2011
fDate :
21-23 June 2011
Firstpage :
226
Lastpage :
231
Abstract :
Mining log pattern to analyze the faults in large scale distributed system is affected by the existence of redundant and ambiguous noisy error logs. While existing works try to compress logs in a coarse granularity from temporal and spatial view to remove the redundancy, they fail to reserve those ambiguous logs that might truly relate to a fault, which misleads the fault characterizing result. By modeling error logs as time series and examining the similarity between trash error log template and target error log, the ambiguous error logs are kept and the affected patterns can be effectively removed. Experiments in a practical complex service-based storage show that up to 92% of the affected patterns can be filtered.
Keywords :
data mining; distributed databases; information filtering; large-scale systems; storage allocation; time series; affected patterns; ambiguous error logs; ambiguous logs; ambiguous noisy error logs; coarse granularity; complex service-based storage systems; fault characterizing result; filtering error log; large scale distributed system; mining log pattern; redundant error logs; target error log; time series; trash error log template; Approximation methods; Computer crashes; Libraries; Matched filters; Time series analysis; Transforms; Service-based storage system; log filtering; trash error logs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Networked Computing and Advanced Information Management (NCM), 2011 7th International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
978-1-4577-0185-6
Electronic_ISBN :
978-89-88678-37-4
Type :
conf
Filename :
5967550
Link To Document :
بازگشت