• DocumentCode
    244919
  • Title

    An Examination of Multivariate Time Series Hashing with Applications to Health Care

  • Author

    Kale, David C. ; Dian Gong ; Zhengping Che ; Yan Liu ; Medioni, Gerard ; Wetzel, Randall ; Ross, Patrick

  • Author_Institution
    Comput. Sci. Dept., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2014
  • fDate
    14-17 Dec. 2014
  • Firstpage
    260
  • Lastpage
    269
  • Abstract
    As large-scale multivariate time series data become increasingly common in application domains, such as health care and traffic analysis, researchers are challenged to build efficient tools to analyze it and provide useful insights. Similarity search, as a basic operator for many machine learning and data mining algorithms, has been extensively studied before, leading to several efficient solutions. However, similarity search for multivariate time series data is intrinsically challenging because (1) there is no conclusive agreement on what is a good similarity metric for multivariate time series data and (2) calculating similarity scores between two time series is often computationally expensive. In this paper, we address this problem by applying a generalized hashing framework, namely kernelized locality sensitive hashing, to accelerate time series similarity search with a series of representative similarity metrics. Experiment results on three large-scale clinical data sets demonstrate the effectiveness of the proposed approach.
  • Keywords
    data mining; health care; learning (artificial intelligence); time series; data mining algorithms; health care; kernelized locality sensitive hashing; large-scale clinical data sets; large-scale multivariate time series data; machine learning; multivariate time series hashing; representative similarity metrics; time series similarity search; Databases; Euclidean distance; Kernel; Time measurement; Time series analysis; Vectors; alignment; dynamic time warping; hashing; kernel methods; nearest neighbor; search; similarity; time series;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4799-4303-6
  • Type

    conf

  • DOI
    10.1109/ICDM.2014.153
  • Filename
    7023343