• DocumentCode
    3249644
  • Title

    SLPMiner: an algorithm for finding frequent sequential patterns using length-decreasing support constraint

  • Author

    Seno, Masakazu ; Karypis, George

  • Author_Institution
    Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    418
  • Lastpage
    425
  • Abstract
    Over the years, a variety of algorithms for finding frequent sequential patterns in very large sequential databases have been developed. The key feature in most of these algorithms is that they use a constant support constraint to control the inherently exponential complexity of the problem. In general, patterns that contain only a few items will tend to be interesting if they have good support, whereas long patterns can still be interesting even if their support is relatively small. Ideally, we need an algorithm that finds all the frequent patterns whose support decreases as a function of their length. In this paper we present an algorithm called SLPMiner that finds all sequential patterns that satisfy a length-decreasing support constraint. Our experimental evaluation shows that SLPMiner achieves up to two orders of magnitude of speedup by effectively exploiting the length-decreasing support constraint, and that its runtime increases gradually as the average length of the sequences (and the discovered frequent patterns) increases.
  • Keywords
    data mining; sequences; very large databases; SLPMiner; algorithms; constant support constraint; exponential complexity; frequent sequential pattern finding; length-decreasing support constraint; runtime; sequences; speedup; very large sequential databases; Association rules; Computer science; Contracts; Data mining; High performance computing; Itemsets; Runtime; Spatial databases; US Department of Energy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-1754-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2002.1183937
  • Filename
    1183937