• DocumentCode
    468290
  • Title

    Estimating Similarity over Data Streams Based on Dynamic Time Warping

  • Author

    Guo, Jian-Kui ; Wang, Qing ; Huang, Zhenhua ; Sun, Shengli ; Zhu, Yang-yong

  • Author_Institution
    Fudan Univ., Shanghai
  • Volume
    3
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    53
  • Lastpage
    57
  • Abstract
    Estimating similarity over data streams has many applications in the data streams environment, such as intrusion detection in the network, data analysis in the sensor net, cluster, k-nearest neighbor queries and so on. However, there has only a few research related to similarity evaluation under data stream contexts. The main reason is because of the native feature of data streams, namely, large, continuous, and only one pass scan. It is hard to find an efficient method to evaluate similarity over data streams. In this paper, we propose a new algorithm ESDS(estimating similarity over data streams), which not only can estimate similarity efficiently over data streams under the time warping distance but is the first time to use DTW(dynamic time warping) distance based on the sliding window to deal with similarity evaluation over data streams. To the best of our knowledge, this paper is the first paper to address this problem. In order to evaluate the efficiency of our algorithm, we present a simple but efficiently method to denote the original stream data. In computing the distance of DTW between data streams by using dynamic programming, we also introduce a new distance of DTW which can compute the similarity over data streams efficiently. The experiments of many real and synthetic data sets show that our algorithm can evaluate the similarity over data streams efficiently and not be studied in the previous research.
  • Keywords
    data handling; dynamic programming; security of data; time warp simulation; data streams; dynamic programming; dynamic time warping; intrusion detection; k-nearest neighbor queries; Algorithm design and analysis; Application software; Computer networks; Data analysis; Dynamic programming; Electronic mail; Electrostatic discharge; Intrusion detection; Spatial databases; Sun;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.274
  • Filename
    4406201