DocumentCode
2130918
Title
An Efficient Sequential Pattern Mining Algorithm Based on the 2-Sequence Matrix
Author
Hsieh, Chia-Ying ; Yang, Don-Lin ; Wu, Jungpin
Author_Institution
Dept. of Inf., Eng. & Comput. Sci., Feng Chia Univ., Taichung
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
583
Lastpage
591
Abstract
Sequential pattern mining has become more and more popular in recent years due to its wide applications and the fact that it can find more information than association rules. Two famous algorithms in sequential pattern mining are AprioriAll and PrefixSpan. These two algorithms not only need to scan a database or projected-databases many times, but also require setting a minimal support threshold to prune infrequent data to obtain useful sequential patterns efficiently. In addition, they must rescan the database if new items or sequences are added. In this paper, we propose a novel algorithm called efficient sequential pattern enumeration (ESPE) to solve the above problems. In addition, our method can be applied in many applications, such as for the itemsets appearing at the same time in a sequence. In our experiments, we show that the performance of ESPE is better than the other two methods using various datasets.
Keywords
data mining; matrix algebra; 2-sequence matrix; AprioriAll; PrefixSpan; association rules; efficient sequential pattern enumeration; sequential pattern mining algorithm; Application software; Association rules; Bioinformatics; Computer science; Conferences; Data engineering; Data mining; Databases; Itemsets; Statistics; Sequential pattern; association rule; candidate enumeration; data mining; minimum support;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2008. ICDMW '08. IEEE International Conference on
Conference_Location
Pisa
Print_ISBN
978-0-7695-3503-6
Electronic_ISBN
978-0-7695-3503-6
Type
conf
DOI
10.1109/ICDMW.2008.82
Filename
4733982
Link To Document