DocumentCode
2865495
Title
Sequential pattern mining in multiple streams
Author
Chen, Gong ; Wu, Xindong ; Zhu, Xingquan
Author_Institution
Dept. of Comput. Sci., Vermont, Burlington Univ., VT, USA
fYear
2005
fDate
27-30 Nov. 2005
Abstract
In this paper, we deal with mining sequential patterns in multiple data streams. Building on a state-of-the-art sequential pattern mining algorithm PrefixSpan for mining transaction databases, we propose MILE, an efficient algorithm to facilitate the mining process. MILE recursively utilizes the knowledge of existing patterns to avoid redundant data scanning, and can therefore effectively speed up the new patterns´ discovery process. Another unique feature of MILE is that it can incorporate some prior knowledge of the data distribution in data streams into the mining process to further improve the performance. Extensive empirical results show that MILE is significantly faster than PrefixSpan. As MILE consumes more memory than PrefixSpan, we also present a solution to balance the memory usage and time efficiency in memory constrained environments.
Keywords
data mining; pattern classification; transaction processing; MILE algorithm; PrefixSpan algorithm; data distribution; multiple data streams; pattern discovery; sequential pattern mining; transaction database mining; Computer science; Heart rate; History; Pattern matching; Steady-state; Transaction databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, Fifth IEEE International Conference on
ISSN
1550-4786
Print_ISBN
0-7695-2278-5
Type
conf
DOI
10.1109/ICDM.2005.130
Filename
1565732
Link To Document