• DocumentCode
    2130918
  • Title

    An Efficient Sequential Pattern Mining Algorithm Based on the 2-Sequence Matrix

  • Author

    Hsieh, Chia-Ying ; Yang, Don-Lin ; Wu, Jungpin

  • Author_Institution
    Dept. of Inf., Eng. & Comput. Sci., Feng Chia Univ., Taichung
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    583
  • Lastpage
    591
  • Abstract
    Sequential pattern mining has become more and more popular in recent years due to its wide applications and the fact that it can find more information than association rules. Two famous algorithms in sequential pattern mining are AprioriAll and PrefixSpan. These two algorithms not only need to scan a database or projected-databases many times, but also require setting a minimal support threshold to prune infrequent data to obtain useful sequential patterns efficiently. In addition, they must rescan the database if new items or sequences are added. In this paper, we propose a novel algorithm called efficient sequential pattern enumeration (ESPE) to solve the above problems. In addition, our method can be applied in many applications, such as for the itemsets appearing at the same time in a sequence. In our experiments, we show that the performance of ESPE is better than the other two methods using various datasets.
  • Keywords
    data mining; matrix algebra; 2-sequence matrix; AprioriAll; PrefixSpan; association rules; efficient sequential pattern enumeration; sequential pattern mining algorithm; Application software; Association rules; Bioinformatics; Computer science; Conferences; Data engineering; Data mining; Databases; Itemsets; Statistics; Sequential pattern; association rule; candidate enumeration; data mining; minimum support;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops, 2008. ICDMW '08. IEEE International Conference on
  • Conference_Location
    Pisa
  • Print_ISBN
    978-0-7695-3503-6
  • Electronic_ISBN
    978-0-7695-3503-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2008.82
  • Filename
    4733982