• DocumentCode
    2864705
  • Title

    HOT SAX: efficiently finding the most unusual time series subsequence

  • Author

    Keogh, Eamonn ; Lin, Jessica ; Fu, Ada

  • Author_Institution
    Dept. of Comput. Sci. & Eng., California Univ., Riverside, CA, USA
  • fYear
    2005
  • fDate
    27-30 Nov. 2005
  • Abstract
    In this work, we introduce the new problem of finding time series discords. Time series discords are subsequences of a longer time series that are maximally different to all the rest of the time series subsequences. They thus capture the sense of the most unusual subsequence within a time series. Time series discords have many uses for data mining, including improving the quality of clustering, data cleaning, summarization, and anomaly detection. Discords are particularly attractive as anomaly detectors because they only require one intuitive parameter (the length of the subsequence) unlike most anomaly detection algorithms that typically require many parameters. We evaluate our work with a comprehensive set of experiments. In particular, we demonstrate the utility of discords with objective experiments on domains as diverse as Space Shuttle telemetry monitoring, medicine, surveillance, and industry, and we demonstrate the effectiveness of our discord discovery algorithm with more than one million experiments, on 82 different datasets from diverse domains.
  • Keywords
    data mining; time series; HOT SAX; anomaly detection; clustering quality; data cleaning; data mining; summarization; time series discords; Aerospace industry; Cleaning; Computer science; Data mining; Detection algorithms; Detectors; Monitoring; Space shuttles; Surveillance; Telemetry; Anomaly Detection; Clustering; Time Series Data Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, Fifth IEEE International Conference on
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2278-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2005.79
  • Filename
    1565683