• DocumentCode
    843631
  • Title

    A unifying framework for detecting outliers and change points from time series

  • Author

    Takeuchi, Jun-Ichi ; Yamanishi, Kenji

  • Author_Institution
    Internet Syst. Res. Labs., NEC Corp., Kanngawa, Japan
  • Volume
    18
  • Issue
    4
  • fYear
    2006
  • fDate
    4/1/2006 12:00:00 AM
  • Firstpage
    482
  • Lastpage
    492
  • Abstract
    We are concerned with the issue of detecting outliers and change points from time series. In the area of data mining, there have been increased interest in these issues since outlier detection is related to fraud detection, rare event discovery, etc., while change-point detection is related to event/trend change detection, activity monitoring, etc. Although, in most previous work, outlier detection and change point detection have not been related explicitly, this paper presents a unifying framework for dealing with both of them. In this framework, a probabilistic model of time series is incrementally learned using an online discounting learning algorithm, which can track a drifting data source adaptively by forgetting out-of-date statistics gradually. A score for any given data is calculated in terms of its deviation from the learned model, with a higher score indicating a high possibility of being an outlier. By taking an average of the scores over a window of a fixed length and sliding the window, we may obtain a new time series consisting of moving-averaged scores. Change point detection is then reduced to the issue of detecting outliers in that time series. We compare the performance of our framework with those of conventional methods to demonstrate its validity through simulation and experimental applications to incidents detection in network security.
  • Keywords
    data mining; learning (artificial intelligence); probability; security of data; time series; change-point detection; data mining; incidents detection; network security; online discounting learning algorithm; outlier detection framework; probabilistic model; time series; Change detection algorithms; Data mining; Data security; Event detection; Histograms; Intrusion detection; Monitoring; Statistics; AR model.; Time series; change point; data mining; network security;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2006.1599387
  • Filename
    1599387