Title :
Evolving Sequential Patterns Mining Model over Click Stream with Levenshtein-Automata
Author :
Li, Haifeng ; Chen, Hong
Author_Institution :
Key Lab. of Data Eng. & Knowledge Eng., Renmin Univ. of China, Beijing
Abstract :
Sequential pattern mining is an important problem in continuous, fast, dynamic and unlimited stream mining. Recently approximate mining algorithms are proposed which spend too many system resources and can only obtain the partial feature of stream. In this paper, a multi-level evolving sequential pattern mining model ESPMM is presented to address this problem thus the mostly entire stream feature is obtained. Furthermore, because of the smaller support of sequential patterns in each level, a mining method BMLA based on Levenshtein-Automata is proposed which builds state conversion model to compute sequences´ similarity in linear time. The experiment results show this model is effective and efficient.
Keywords :
data mining; finite automata; Levenshtein automata; approximate mining algorithm; click stream; multilevel evolving sequential pattern mining model; sequential patterns mining model; state conversion model; stream mining; Automata; Costs; Data engineering; Data security; Data structures; Databases; Knowledge engineering; Laboratories; Pattern analysis; Web mining;
Conference_Titel :
Innovative Computing Information and Control, 2008. ICICIC '08. 3rd International Conference on
Conference_Location :
Dalian, Liaoning
Print_ISBN :
978-0-7695-3161-8
Electronic_ISBN :
978-0-7695-3161-8
DOI :
10.1109/ICICIC.2008.262