Title :
Filtering error log as time series in complex service-based storage systems
Author :
Rao, Xiang ; Yin, Gang ; Wang, Huaimin ; Shi, Dianxi ; Zhu, Yanxu
Author_Institution :
Nat. Lab. for Parallel & Distrib. Process., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Mining log pattern to analyze the faults in large scale distributed system is affected by the existence of redundant and ambiguous noisy error logs. While existing works try to compress logs in a coarse granularity from temporal and spatial view to remove the redundancy, they fail to reserve those ambiguous logs that might truly relate to a fault, which misleads the fault characterizing result. By modeling error logs as time series and examining the similarity between trash error log template and target error log, the ambiguous error logs are kept and the affected patterns can be effectively removed. Experiments in a practical complex service-based storage show that up to 92% of the affected patterns can be filtered.
Keywords :
data mining; distributed databases; information filtering; large-scale systems; storage allocation; time series; affected patterns; ambiguous error logs; ambiguous logs; ambiguous noisy error logs; coarse granularity; complex service-based storage systems; fault characterizing result; filtering error log; large scale distributed system; mining log pattern; redundant error logs; target error log; time series; trash error log template; Approximation methods; Computer crashes; Libraries; Matched filters; Time series analysis; Transforms; Service-based storage system; log filtering; trash error logs;
Conference_Titel :
Networked Computing and Advanced Information Management (NCM), 2011 7th International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
978-1-4577-0185-6
Electronic_ISBN :
978-89-88678-37-4