• DocumentCode
    476420
  • Title

    An efficient single pass approach to frequent episode discovery in sequence data

  • Author

    Savla, S. ; Chakravarthy, Srinath

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX
  • fYear
    2008
  • fDate
    21-22 July 2008
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    There is a considerable body of work on sequence mining of transactional data. Most of the related work on point data make several passes over the entire dataset in order to discover frequently occurring episodes/patterns. The OnePass frequent episode discovery (or FED) algorithm, proposed in this paper, takes a different approach than the traditional apriori class of pattern detection algorithms. In our approach, significant intervals for each event (or device) are computed first (independently) and are used for detecting frequent patterns along with the interval in which they occur. The advantage of this approach is that the data set is compressed substantially in the first step thereby reducing the size of input used and hence the computation. Also, each event/device can be processed individually allowing for parallel computation of individual events. The OnePass FED algorithm then works on these significant intervals to discover interesting episodes in a single pass as compared to the apriori class of algorithms. Our approach is significantly more efficient and scales as well as compared to traditional mining algorithms. Extensive experimental analysis establishes its efficiency and scalability.
  • Keywords
    data mining; pattern recognition; FED algorithm; frequent episode discovery; pattern detection algorithms; sequence mining; single pass approach; transactional data;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Intelligent Environments, 2008 IET 4th International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    0537-9989
  • Print_ISBN
    978-0-86341-894-5
  • Type

    conf

  • Filename
    4629753