DocumentCode
2291146
Title
Mining sequential patterns using graph search techniques
Author
Huang, Yin-Fu ; Lin, Shao-Yuan
Author_Institution
Inst. of Electron. & Inf. Eng., National Yunlin Univ. of Sci. & Technol., Taiwan
fYear
2003
fDate
3-6 Nov. 2003
Firstpage
4
Lastpage
9
Abstract
Sequential patterns discovery had emerged as an important problem in data mining. In this paper, we propose an effective GST algorithm for mining sequential patterns in a large transaction database. Different from the apriori-like algorithms, the GST algorithm can out of order find large k-sequences (k >= 3);i.e., we can find large k-sequences not directly through large (k-1)-sequences. This leads to that our algorithm has much better performance than the Apriori-like algorithms. Besides, we also propose the method to find new sequential patterns by scanning only new transactions since the database was increased. Through several comprehensive experiments, the GST algorithm gains a significant performance improvement over the Apriori-like algorithms. Also we found as long as the ratio of the items purchased in new transactions is always much better than scanning the entire database.
Keywords
data mining; graph theory; pattern recognition; transaction processing; very large databases; Apriori-like algorithms; GST algorithm; data mining; graph search techniques; k-sequences; performance improvement; sequential patterns discovery; sequential patterns mining; transaction database; Association rules; Books; Councils; Data engineering; Data mining; Marketing and sales; Out of order; Performance gain; Software algorithms; Transaction databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Software and Applications Conference, 2003. COMPSAC 2003. Proceedings. 27th Annual International
ISSN
0730-3157
Print_ISBN
0-7695-2020-0
Type
conf
DOI
10.1109/CMPSAC.2003.1245314
Filename
1245314
Link To Document