• DocumentCode
    1698765
  • Title

    Finding approximate frequent patterns in streaming medical data

  • Author

    Lin, Jessica ; Li, Yuan

  • Author_Institution
    Comput. Sci. Dept., George Mason Univ., Fairfax, VA, USA
  • fYear
    2010
  • Firstpage
    13
  • Lastpage
    18
  • Abstract
    Time series data is ubiquitous and plays an important role in virtually every domain. For example, in medicine, the advancement of computer technology has enabled more sophisticated patients monitoring, either on-site or remotely. Such monitoring produces massive amount of time series data, which contain valuable information for pattern learning and knowledge discovery. In this paper, we explore the problem of identifying frequently occurring patterns, or motifs, in streaming medical data. The problem of frequent patterns mining has many potential applications, including compression, summarization, and event prediction. We propose a novel approach based on grammar induction that allows the discovery of approximate, variable-length motifs in streaming data. The preliminary results show that the grammar-based approach is able to find some important motifs in some medical data, and suggest that using grammar-based algorithms for time series pattern discovery might be worth exploring.
  • Keywords
    data compression; data mining; grammars; medical information systems; pattern classification; time series; approximate frequent pattern mining; computer technology; grammar-based approach; knowledge discovery; medical data streaming; pattern learning; sophisticated patient monitoring; time series pattern discovery; variable-length motifs; Algorithm design and analysis; Approximation algorithms; Compression algorithms; Data mining; Grammar; Probabilistic logic; Time series analysis; Frequent Patterns; Grammar Induction; Time Series;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer-Based Medical Systems (CBMS), 2010 IEEE 23rd International Symposium on
  • Conference_Location
    Perth, WA
  • ISSN
    1063-7125
  • Print_ISBN
    978-1-4244-9167-4
  • Type

    conf

  • DOI
    10.1109/CBMS.2010.6042675
  • Filename
    6042675