• DocumentCode
    1758951
  • Title

    Efficient Motif Discovery for Large-Scale Time Series in Healthcare

  • Author

    Bo Liu ; Jianqiang Li ; Cheng Chen ; Wei Tan ; Qiang Chen ; Mengchu Zhou

  • Author_Institution
    Dept. of Autom., Tsinghua Univ., Beijing, China
  • Volume
    11
  • Issue
    3
  • fYear
    2015
  • fDate
    42156
  • Firstpage
    583
  • Lastpage
    590
  • Abstract
    Analyzing time series data can reveal the temporal behavior of the underlying mechanism producing the data. Time series motifs, which are similar subsequences or frequently occurring patterns, have significant meanings for researchers especially in medical domain. With the fast growth of time series data, traditional methods for motif discovery are inefficient and not applicable to large-scale data. This work proposes an efficient Motif Discovery method for Large-scale time series (MDLats). By computing standard motifs, MDLats eliminates a majority of redundant computation in the related arts and reuses existing information to the maximum. All the motif types and subsequences are generated for subsequent analysis and classification. Our system is implemented on a Hadoop platform and deployed in a hospital for clinical electrocardiography classification. The experiments on real-world healthcare data show that MDLats outperform the state-of-the-art methods even in large time series.
  • Keywords
    data mining; electrocardiography; health care; medical signal processing; parallel processing; signal classification; time series; Hadoop platform; clinical electrocardiography classification; data analysis; data mining; healthcare; large-scale time series; motif discovery method; redundant computation; Algorithm design and analysis; Approximation algorithms; Electrocardiography; Electronic mail; Euclidean distance; Standards; Time series analysis; Data mining; Motif; Pattern discovery; Time series; motif; pattern discovery; time series;
  • fLanguage
    English
  • Journal_Title
    Industrial Informatics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1551-3203
  • Type

    jour

  • DOI
    10.1109/TII.2015.2411226
  • Filename
    7056438