Title :
Mining Top-K Closed Frequent Traversal Sequences from Session Streams
Author :
Zhang, Xiaojian ; Peng, Huili ; Shao, Chao
Author_Institution :
Dept. of Comput. Sci., Henan Univ. of Finance & Econ., Zhengzhou
Abstract :
Approximate mining top-k closed frequent traversal sequence (Topk_CFTS) has two methods, namely, false-positive oriented and false-negative oriented. Most false-positive approaches require relaxation ratio rho to control memory consumption and mining accuracy. However, it is difficult for users to provide a proper rho value. A higher rho may reduce output precision, and a smaller rho will make memory consumption large. To resolve the conflict, this paper designs a regulatory factor to adjust rho value, and proposes an algorithm, TStream, based on false-negative method to maintain the set of Topk_CFTS in session streams with time-sensitive sliding windows. Experiments show that TStream algorithm performs much better than many established algorithms for mining Topk_CFTS.
Keywords :
data mining; TStream; false-negative oriented; false-positive oriented; mining top-k closed frequent traversal sequences; Algorithm design and analysis; Chaos; Computer science; Data mining; Databases; Finance; Fuzzy systems; Itemsets; Sensor phenomena and characterization; Web pages;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Jinan Shandong
Print_ISBN :
978-0-7695-3305-6
DOI :
10.1109/FSKD.2008.83