Title :
Spatio-temporal factorization of log data for understanding network events
Author :
Kimura, Tomohiro ; Ishibashi, Koji ; Mori, Takayoshi ; Sawada, Hideyuki ; Toyono, Tsuyoshi ; Nishimatsu, Ken ; Watanabe, Atsuyori ; Shimoda, Akihiro ; Shiomoto, Kohei
Author_Institution :
NTT Network Technol. Labs., NTT Corp., Musashino, Japan
fDate :
April 27 2014-May 2 2014
Abstract :
Understanding the impacts and patterns of network events such as link flaps or hardware errors is crucial for diagnosing network anomalies. In large production networks, analyzing the log messages that record network events has become a challenging task due to the following two reasons. First, the log messages are composed of unstructured text messages generated by vendor-specific rules. Second, network equipment such as routers, switches, and RADIUS severs generate various log messages induced by network events that span across several geographical locations, network layers, protocols, and services. In this paper, we have tackled these obstacles by building two novel techniques: statistical template extraction (STE) and log tensor factorization (LTF). STE leverages a statistical clustering technique to automatically extract primary templates from unstructured log messages. LTF aims to build a statistical model that captures spatial-temporal patterns of log messages. Such spatial-temporal patterns provide useful insights into understanding the impacts and root cause of hidden network events. This paper first formulates our problem in a mathematical way. We then validate our techniques using massive amount of network log messages collected from a large operating network. We also demonstrate several case studies that validate the usefulness of our technique.
Keywords :
IP networks; electronic messaging; matrix decomposition; protocols; spatiotemporal phenomena; statistics; telecommunication network management; telecommunication network routing; tensors; LTF; RADIUS severs; STE; geographical locations; hardware errors; link flaps; log data; log tensor factorization; network anomalies; network equipment; network events; network layers; network log messages; production networks; protocols; spatial-temporal patterns; spatio-temporal factorization; statistical clustering technique; statistical model; statistical template extraction; unstructured text messages; vendor-specific rules; Computers; Conferences; Data mining; Hidden Markov models; Mathematical model; Protocols; Tensile stress;
Conference_Titel :
INFOCOM, 2014 Proceedings IEEE
Conference_Location :
Toronto, ON
DOI :
10.1109/INFOCOM.2014.6847986