• DocumentCode
    170454
  • Title

    Spatio-temporal factorization of log data for understanding network events

  • Author

    Kimura, Tomohiro ; Ishibashi, Koji ; Mori, Takayoshi ; Sawada, Hideyuki ; Toyono, Tsuyoshi ; Nishimatsu, Ken ; Watanabe, Atsuyori ; Shimoda, Akihiro ; Shiomoto, Kohei

  • Author_Institution
    NTT Network Technol. Labs., NTT Corp., Musashino, Japan
  • fYear
    2014
  • fDate
    April 27 2014-May 2 2014
  • Firstpage
    610
  • Lastpage
    618
  • Abstract
    Understanding the impacts and patterns of network events such as link flaps or hardware errors is crucial for diagnosing network anomalies. In large production networks, analyzing the log messages that record network events has become a challenging task due to the following two reasons. First, the log messages are composed of unstructured text messages generated by vendor-specific rules. Second, network equipment such as routers, switches, and RADIUS severs generate various log messages induced by network events that span across several geographical locations, network layers, protocols, and services. In this paper, we have tackled these obstacles by building two novel techniques: statistical template extraction (STE) and log tensor factorization (LTF). STE leverages a statistical clustering technique to automatically extract primary templates from unstructured log messages. LTF aims to build a statistical model that captures spatial-temporal patterns of log messages. Such spatial-temporal patterns provide useful insights into understanding the impacts and root cause of hidden network events. This paper first formulates our problem in a mathematical way. We then validate our techniques using massive amount of network log messages collected from a large operating network. We also demonstrate several case studies that validate the usefulness of our technique.
  • Keywords
    IP networks; electronic messaging; matrix decomposition; protocols; spatiotemporal phenomena; statistics; telecommunication network management; telecommunication network routing; tensors; LTF; RADIUS severs; STE; geographical locations; hardware errors; link flaps; log data; log tensor factorization; network anomalies; network equipment; network events; network layers; network log messages; production networks; protocols; spatial-temporal patterns; spatio-temporal factorization; statistical clustering technique; statistical model; statistical template extraction; unstructured text messages; vendor-specific rules; Computers; Conferences; Data mining; Hidden Markov models; Mathematical model; Protocols; Tensile stress;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INFOCOM, 2014 Proceedings IEEE
  • Conference_Location
    Toronto, ON
  • Type

    conf

  • DOI
    10.1109/INFOCOM.2014.6847986
  • Filename
    6847986