DocumentCode :
2369791
Title :
TSP: mining top-K closed sequential patterns
Author :
Tzvetkov, Petre ; Yan, Xifeng ; Han, Jiawei
Author_Institution :
Illinois Univ., Urbana, IL, USA
fYear :
2003
fDate :
19-22 Nov. 2003
Firstpage :
347
Lastpage :
354
Abstract :
Sequential pattern mining has been studied extensively in data mining community. Most previous studies require the specification of a minimum support threshold to perform the mining. However, it is difficult for users to provide an appropriate threshold in practice. To overcome this difficulty, we propose an alternative task: mining top-k frequent closed sequential patterns of length no less than min-l, where k is the desired number of closed sequential patterns to be mined, and minl, is the minimum length of each pattern. We mine closed patterns since they are compact representations of frequent patterns. We developed an efficient algorithm, called TSP, which makes use of the length constraint and the properties of top-k closed sequential patterns to perform dynamic support-raising and projected database-pruning. Our extensive performance study shows that TSP outperforms the closed sequential pattern mining algorithm even when the latter is running with the best tuned minimum support threshold.
Keywords :
data mining; minimisation; sequences; very large databases; TSP algorithm; data mining; dynamic support-raising; minimum support threshold specification; projected database-pruning; sequential pattern mining; top-k frequent closed sequential pattern; Computer science; Data mining; Databases; Frequency; Itemsets; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
Type :
conf
DOI :
10.1109/ICDM.2003.1250939
Filename :
1250939
Link To Document :
بازگشت