DocumentCode :
2291146
Title :
Mining sequential patterns using graph search techniques
Author :
Huang, Yin-Fu ; Lin, Shao-Yuan
Author_Institution :
Inst. of Electron. & Inf. Eng., National Yunlin Univ. of Sci. & Technol., Taiwan
fYear :
2003
fDate :
3-6 Nov. 2003
Firstpage :
4
Lastpage :
9
Abstract :
Sequential patterns discovery had emerged as an important problem in data mining. In this paper, we propose an effective GST algorithm for mining sequential patterns in a large transaction database. Different from the apriori-like algorithms, the GST algorithm can out of order find large k-sequences (k >= 3);i.e., we can find large k-sequences not directly through large (k-1)-sequences. This leads to that our algorithm has much better performance than the Apriori-like algorithms. Besides, we also propose the method to find new sequential patterns by scanning only new transactions since the database was increased. Through several comprehensive experiments, the GST algorithm gains a significant performance improvement over the Apriori-like algorithms. Also we found as long as the ratio of the items purchased in new transactions is always much better than scanning the entire database.
Keywords :
data mining; graph theory; pattern recognition; transaction processing; very large databases; Apriori-like algorithms; GST algorithm; data mining; graph search techniques; k-sequences; performance improvement; sequential patterns discovery; sequential patterns mining; transaction database; Association rules; Books; Councils; Data engineering; Data mining; Marketing and sales; Out of order; Performance gain; Software algorithms; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Software and Applications Conference, 2003. COMPSAC 2003. Proceedings. 27th Annual International
ISSN :
0730-3157
Print_ISBN :
0-7695-2020-0
Type :
conf
DOI :
10.1109/CMPSAC.2003.1245314
Filename :
1245314
Link To Document :
بازگشت