DocumentCode
170454
Title
Spatio-temporal factorization of log data for understanding network events
Author
Kimura, Tomohiro ; Ishibashi, Koji ; Mori, Takayoshi ; Sawada, Hideyuki ; Toyono, Tsuyoshi ; Nishimatsu, Ken ; Watanabe, Atsuyori ; Shimoda, Akihiro ; Shiomoto, Kohei
Author_Institution
NTT Network Technol. Labs., NTT Corp., Musashino, Japan
fYear
2014
fDate
April 27 2014-May 2 2014
Firstpage
610
Lastpage
618
Abstract
Understanding the impacts and patterns of network events such as link flaps or hardware errors is crucial for diagnosing network anomalies. In large production networks, analyzing the log messages that record network events has become a challenging task due to the following two reasons. First, the log messages are composed of unstructured text messages generated by vendor-specific rules. Second, network equipment such as routers, switches, and RADIUS severs generate various log messages induced by network events that span across several geographical locations, network layers, protocols, and services. In this paper, we have tackled these obstacles by building two novel techniques: statistical template extraction (STE) and log tensor factorization (LTF). STE leverages a statistical clustering technique to automatically extract primary templates from unstructured log messages. LTF aims to build a statistical model that captures spatial-temporal patterns of log messages. Such spatial-temporal patterns provide useful insights into understanding the impacts and root cause of hidden network events. This paper first formulates our problem in a mathematical way. We then validate our techniques using massive amount of network log messages collected from a large operating network. We also demonstrate several case studies that validate the usefulness of our technique.
Keywords
IP networks; electronic messaging; matrix decomposition; protocols; spatiotemporal phenomena; statistics; telecommunication network management; telecommunication network routing; tensors; LTF; RADIUS severs; STE; geographical locations; hardware errors; link flaps; log data; log tensor factorization; network anomalies; network equipment; network events; network layers; network log messages; production networks; protocols; spatial-temporal patterns; spatio-temporal factorization; statistical clustering technique; statistical model; statistical template extraction; unstructured text messages; vendor-specific rules; Computers; Conferences; Data mining; Hidden Markov models; Mathematical model; Protocols; Tensile stress;
fLanguage
English
Publisher
ieee
Conference_Titel
INFOCOM, 2014 Proceedings IEEE
Conference_Location
Toronto, ON
Type
conf
DOI
10.1109/INFOCOM.2014.6847986
Filename
6847986
Link To Document