DocumentCode :
2362295
Title :
Fast and Effective Generation of Candidate-Sequences for Sequential Pattern Mining
Author :
Liao, Wei-Cheng ; Yang, Don-Lin ; Wu, Jungpin ; Hung, Ming-Chuan
Author_Institution :
Dept. of Inf. Eng. & Comput. Sci., Feng Chia Univ., Taichung, Taiwan
fYear :
2009
fDate :
25-27 Aug. 2009
Firstpage :
2006
Lastpage :
2009
Abstract :
The existing sequential pattern mining algorithms fall into two categories. One is the candidate-generation-and-test approach such as GSP, and the other is the pattern-growth approach such as PrefixSpan. Both GSP and PrefixSpan require setting the minimum support before their execution. We propose a new approach, called fast and effective generation of candidate-sequences (FEGC), to mine sequential patterns without predetermining the minimum support threshold. The main contribution is to scan all transactions in the database once and generate all the subsequences with their support counters. The experiments show that our algorithm performs well in various datasets.
Keywords :
data mining; GSP; PrefixSpan; fast and effective generation of candidate-sequences; sequential pattern mining algorithms; Computer science; Conference management; Counting circuits; Engineering management; Industrial engineering; Itemsets; Spatial databases; Statistics; Transaction databases; candidate generation; data mining; minimum support; sequential pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INC, IMS and IDC, 2009. NCM '09. Fifth International Joint Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-5209-5
Electronic_ISBN :
978-0-7695-3769-6
Type :
conf
DOI :
10.1109/NCM.2009.266
Filename :
5331548
Link To Document :
بازگشت