• DocumentCode
    2291146
  • Title

    Mining sequential patterns using graph search techniques

  • Author

    Huang, Yin-Fu ; Lin, Shao-Yuan

  • Author_Institution
    Inst. of Electron. & Inf. Eng., National Yunlin Univ. of Sci. & Technol., Taiwan
  • fYear
    2003
  • fDate
    3-6 Nov. 2003
  • Firstpage
    4
  • Lastpage
    9
  • Abstract
    Sequential patterns discovery had emerged as an important problem in data mining. In this paper, we propose an effective GST algorithm for mining sequential patterns in a large transaction database. Different from the apriori-like algorithms, the GST algorithm can out of order find large k-sequences (k >= 3);i.e., we can find large k-sequences not directly through large (k-1)-sequences. This leads to that our algorithm has much better performance than the Apriori-like algorithms. Besides, we also propose the method to find new sequential patterns by scanning only new transactions since the database was increased. Through several comprehensive experiments, the GST algorithm gains a significant performance improvement over the Apriori-like algorithms. Also we found as long as the ratio of the items purchased in new transactions is always much better than scanning the entire database.
  • Keywords
    data mining; graph theory; pattern recognition; transaction processing; very large databases; Apriori-like algorithms; GST algorithm; data mining; graph search techniques; k-sequences; performance improvement; sequential patterns discovery; sequential patterns mining; transaction database; Association rules; Books; Councils; Data engineering; Data mining; Marketing and sales; Out of order; Performance gain; Software algorithms; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Software and Applications Conference, 2003. COMPSAC 2003. Proceedings. 27th Annual International
  • ISSN
    0730-3157
  • Print_ISBN
    0-7695-2020-0
  • Type

    conf

  • DOI
    10.1109/CMPSAC.2003.1245314
  • Filename
    1245314