• DocumentCode
    1261785
  • Title

    A Lightweight Algorithm for Message Type Extraction in System Application Logs

  • Author

    Makanju, Adetokunbo ; Zincir-Heywood, A. Nur ; Milios, Evangelos E.

  • Author_Institution
    Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
  • Volume
    24
  • Issue
    11
  • fYear
    2012
  • Firstpage
    1921
  • Lastpage
    1936
  • Abstract
    Message type or message cluster extraction is an important task in the analysis of system logs in computer networks. Defining these message types automatically facilitates the automatic analysis of system logs. When the message types that exist in a log file are represented explicitly, they can form the basis for carrying out other automatic application log analysis tasks. In this paper, we introduce a novel algorithm for carrying out message type extraction from event log files. IPLoM, which stands for Iterative Partitioning Log Mining, works through a 4-step process. The first three steps hierarchically partition the event log into groups of event log messages or event clusters. In its fourth and final stage, IPLoM produces a message type description or line format for each of the message clusters. IPLoM is able to find clusters in data irrespective of the frequency of its instances in the data, it scales gracefully in the case of long message type patterns and produces message type descriptions at a level of abstraction, which is preferred by a human observer. Evaluations show that IPLoM outperforms similar algorithms statistically significantly.
  • Keywords
    data mining; fault tolerant computing; iterative methods; pattern clustering; IPLoM; automatic application log analysis tasks; autonomic computing; computer networks; human observer; iterative partitioning log mining; lightweight algorithm; message cluster extraction; message type extraction; message type patterns; system application logs; Buildings; Clustering algorithms; Data mining; Humans; Kernel; Observers; Partitioning algorithms; Algorithms; clustering; event log mining; experimentation; fault management;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2011.138
  • Filename
    5936060